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(A) LENGTH: 3200 base pairs 
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(ii) MOLECULE TYPE:\cDNA 

(ix) FEATURE: 

(A) NAM^/aKEY: CodYng Sequence 

(B) LOCATION: 205. \. 2987 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRI PTIONy: SEQ ID NO : 1 : 

CAATGTCGAA GAGAGCACTA ACCAGGAAAAX TTGCGATTTC ATAAACCCAC TTTATTATTA 60 

AATTCTTACT TGCGTCATAT AAAATAGAAA VcTCAGAGAG TCAAGATAAA AATTCTTGAC 12 0 

AGCTGTTTTG TCATCTTTAA CTTGATTTAC IJTATTTTGTT TCTATATTGA TGCGAATAGT 18 0 

TCTCTAAAAA ACAAAAGCAT TACC ATG AAgXaCT TCG ATT CCT TGG GTT TTA 231 

Met Lys Vhr Ser lie Pro Trp Val Leu 
1 \ 5 



GTT TCC TCC GTG TTA GCT TTC TCA TGT C\C CTA CAG TCA CTA GCT AAC 
Val Ser Ser Val Leu Ala Phe Ser Cys Hi>s Leu Gin Ser Leu Ala Asn 
10 15 \ 20 25 



279 



GAG GAA CTT TTA TCA CCT GAT GAT AGC TTT ^T GGA AAT ATC GAT TCA 
Glu Glu Leu Leu Ser Pro Asp .Asp Ser Phe Aen Gly Asn lie Asp Ser 
30 35 \ 40 



327 



GGA ACG TTT ACT CCA AAA ACT TCA GCC ACA ACA TAT TCT CTA ACA GGA 
Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr ThrWyr Ser Leu Thr Gly 
45 50 \ 55 



375 



GAT GTC TTC TTT TAC GAG CCT GGA AAA GGC ACT dCC TTA TCT GAC AGT 
Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pho Leu Ser Asp Ser 
60 ,65 \ 70 



423 



TGT TTT AAG CAA ACC ACG GAC AAT CTT ACC TTC TTGXGGG AAC GGT CAT 
Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu yly Asn Gly His 
75 80 85 



471 



AGC TTA ACG TTT GGC TTT ATA GAT GCT GGC ACT CAT GQA GGT GCT GCT 
Ser Leu Thr Phe Gly Phe lie Asp Ala Gly Thr His Alk Gly Ala Ala 
90 95 100 \ 105 



519 



GCA TCT ACA ACA GCA AAT AAG AAT CTT ACC TTC TCA GGG\ TTT TCC TTA 
Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser Gly I Phe Ser Leu 
110 115 \ 120 



567 



CTG AGT TTT GAT TCC TCT CCT AGC ACA ACG GTT ACT ACA GGT CAG GGA 
Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr Thr Gly Gin Gly 
125 130 ]\35 



615 



ACG CTT TCC TCA GCA GGA GGC GTA AAT TTA GAA AAT ATT CGT AAA CTT 
Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn lie Ar^ Lys Leu 
140 145 150 



663 



GTA GTT GCT GGG AAT TTT TCT ACT GCA GAT GGT GGA GCT ATC ^AA GGA 
Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly Ala lie \ys Gly 
155 160 165 



711 
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GCG TCT TTC 
Ala Ser Phe 
170 

AAC TCT TCA 
Asn Ser Ser 



CGC ATA GCA 
Arg lie Ala 



TCT ACG TCA 
Ser Thr Ser 
220 

AAC AAC AAA 
Asn Asn Lys 
235 

GGT GCG ATC 
Gly Ala lie 
250 

TCT AAC AAT 
Ser Asn Asn 



GGT GGC GCC 
Gly Gly Ala 



ACA GAG TTT 
Thr Glu Phe 
300 

GCT ATC AGC 
Ala lie Ser 
315 

GGA AAC ATT 
Gly Asn lie 
330 

GAT ACT CCT 
Asp Thr Pro 



ACG GAA TTA 
Thr Glu Leu 



ATC ACT TCA 
lie Thr Ser 
380 



CTT TTA ACT 
Leu Leu Thr 
175 

TCA ACA AAG 
Ser Thr Lys 
190 

AAT AAC ACA 
Asn Asn Thr 
205 

GGA GGC GCT 
Gly Gly Ala 



TTT CTA TAT 
Phe Leu Tyr 



TGC AAC ACC 
Cys Asn Thr. 

255' 

AAG ACT CTG 
Lys Thr Leu 
270 

ATC CAT GCT 
lie His Ala 
285 

CTA CGA AAT 
Leu Arg Asn 



ATC GAT GCC 
lie Asp Ala 



ACC TTT GTA 
Thr Phe Val 
335 

AAA CGT AAT 
Lys Arg Asn 
350 

CGG GCT GCT 
Arg Ala Ala 
365 

GAA GGA ACC 
Glu Gly Thr 



GGC ACT TCT 
Gly Thr Ser 



GGA GGA GCA 
Gly Gly Ala 



GGT TAT GTT 
Gly Tyr Val 
210 

ATC GAT GAT 
lie Asp Asp 
225 

TTT GAA GGG 
Phe Glu Gly 
240 

AAG GCG AGT 
Lys Ala Ser 



ATC TTT GCT 
lie Phe Ala 



AAA AAG CTA 
Lys Lys Leu 
290 

AAT GTC TCA 
Asn Val Ser 
305 

TCA GGA GAG 
Ser Gly Glu 
320 

AGA AAT ACC 
Arg Asn Thr 



GCG ATC AAC 
Ala lie Asn 



AAA AAT CAT 
Lys Asn His 
370 

TCA TCA GAC 
Ser Ser Asp 
385 



GGA GAT GCT 
Gly Asp Ala 
180 

ATT GCT ACT 
lie Ala Thr 
195 

AGA TTC CTA 
Arg Phe Leu 



GAA GGC ACG 
Glu Gly Thr 



AAT GCA GCG 
Asn Ala Ala 
245 

GGA TCT CCT 
Gly Ser Pro 
260 

TCA AAC GTA 
Ser Asn Val 
275 

GCC CTT TCC 
Ala Leu Ser 



TCA GCA ACT 
Ser Ala Thr 



CTC AGT CTT 
Leu Ser Leu 

.325 

CTT ACA ACA 
Leu Thr Thr 
340 

ATA GGA AGT 
lie Gly Ser 
355 

ACA ATT TTC 
Thr lie Phe 



GTA TTG AAG 
Val Leu Lys 



CTT TTT AGT 
Leu Phe Ser 



ACA GCA GGC 
Thr Ala Gly 
200 

TCT AAC ATA 
Ser Asn lie 
215 

TCG ATA CTA 
Ser lie Leu 
230 

AAA ACT ACT 
Lys Thr Thr 



GAA CTG ATA 
Glu Leu lie 



GCA GAA ACA 
Ala Glu Thr 
280 

TCT GGA GGC 
Ser Gly Gly 
295 

CCT AAG GGG 
Pro Lys Gly 
310 

TCT GCA GAG 
Ser Ala Glu 



ACC GGA AGT 
Thr Gly Ser 



AAC GGG AAA 
Asn Gly Lys 
360 

TTC TAT GAT 
Phe Tyr Asp 
375 

ATA AAT AAC 
lie Asn Asn 
390 



AAC 7 59 

Asn 

185 

GCT 8 07 

Ala 



GCG 855 
Ala 



TCG 903 
Ser 



GGC 951 
Gly 



ATC 999 

He 

265 

AGC 1047 
Ser 



TTT 1095 
Phe 



GGT 1143 
Gly 



ACA 1191 
Thr 



ACC 1239 

Thr 

345 

TTC 1287 
Phe 



CCC 1335 
Pro 



GGC 1383 
Gly 
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TCT GCG GGA 
Ser Ala Gly 
395 

GAA ACC CTA 
Glu Thr Leu 
410 

TCA TTC ACG 
Ser Phe Thr 



AAG GGA GTC 
Lys Gly Val 

CTC CTC GGC 
Leu Leu Gly 
460 

ATT ACA ATC 
lie Thr lie 
475 

CAG CCC GTC 
Gin Pro Val 
490 

TCT GGG AAG 
Ser Gly Lys 



CAT ATG TTC 
His Met Phe 



GAT GCT GAT 
Asp Ala Asp 
540 

CCT GCT GAG 
Pro Ala Glu 
555 

GTT AAT TGG 
Val Asn Trp 
570 

ACT TGG ACC 
Thr Trp Thr 



TTA GTA TGC 
Leu Val Cys 



GCT CTC AAT 
7U.a Leu Asn 



ACA GCA GAT 
Thr Ala Asp 
415 

CAG CCA GTC 
Gin Pro Val 
430 

ACT TTA GAG 
Thr Leu Glu 
445 

ATG GAT TCA 
Met Asp Ser 



ACG AAC CTA 
Thr Asn Leu 



AGC CTA ACA 
Ser Leu Thr 
495 

CTC AAC CTG 
Leu Asn Leu 
510 

AGC CAT GAC 
Ser His Asp 
525 

GTT GAT ACT 
Val Asp Thr 



GAT CCT AAT 
Asp Pro Asn 



ACT ACG GAT 
Thr Thr Asp 
575 

AAA ACA GGA 
Lys Thr Gly 
590 

AAT ACC CTA 
Asn Thr Leu 
605 



CCA TAT CAA 
Pro Tyr Gin 
400 

GAA CTT AAA 
Glu Leu Lys 



TCC CTA TCC 
Ser Leu Ser 



AGC ACG AGC 
Ser Thr Ser 
450 

GGA ACG ACA 
Gly Thr Thr 
465 

GGA ATC AAT 
Gly lie Asn 
480 

GCA AAA GGT 
Ala Lys Gly 



ATT GAT ATT 
lie Asp lie 

CAG CTC TTC 
Gin Leu Phe 
530 

AAC GTT GAC 
Asn Val Asp 
545 

TCA GAA TAC 
Ser Glu Tyr 
560 

ACA GCT ACA 
Thr Ala Thr 



TTT GTT CCC 
Phe Val Pro 



TGG GGA GTC 
Trp Gly Val 
610 



GGA ACG ATT 
Gly Thr lie 
405 

GTT GCT GAC 
Val Ala Asp 
420 

GGA GGA AAG 
Gly Gly Lys 
435 

TTC TCT CT^A 
Phe Ser Gin 



TTA TCA ACT 
Leu Ser Thr 



GTT GAC TCC 
Val Asp Ser 
485 

GCT TCA AAT 
Ala Ser Asn 
500 

GAA GGG AAC 
Glu Gly Asn 
515 

TCT CTA TTA 
Ser Leu Leu 



ATC AGC AGC 
lie Ser Ser 



GGA TTC CAA 
Gly Phe Gin 
565 

AAT ACA AAA 
Asn Thr Lys 
580 

AGC CCC GAA 
Ser Pro Glu 
595 

TTT ACT GAC 
Phe Thr Asp 



CTA TTT TCT 
Leu Phe Ser 



AAT TTA AAA 
Asn Leu Lys 

TTA TTG CTA 
Leu Leu Leu 
440 

GAG GCC GGT 
Glu Ala Gly 
455 

ACA GCT GGG 
Thr Ala Gly 
470 

TTA GGT CTT 
Leu Gly Leu 



7WV GTG ATC 
Lys Val lie 



ATT TAT GAA 
lie Tyr Glu 
520 

AAA ATC ACG 
Lys lie Thr 
535 

CTT ATC CCT 
Leu lie Pro 
550 

GGA CAA TGG 
Gly Gin Trp 



GAG GCC ACG 
Glu Ala Thr 



AGA AAA TCT 
Arg Lys Ser 
600 

ATT CGC TCT 
lie Arg Ser 
615 



GGA 14 31 
Gly 



TCT 1479 

Ser 

425 

CAA 1527 
Gin 



TCT 1575 
Ser 



AGT 162 3 

Ser 



AAG 1671 
Lys 



GTA 1719 

Val 

505 

AGT 17 67 

Ser 



GTT 1815 
Val 



GTT 1863 
Val 



AAT 1911 
Asn 



GCA 1959 

Ala' 

585 

GCG 2007 
Ala 

CTG 2055 
Leu 
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CAA CAG CTT 
Gin Gin Leu 
620 

TTC TGG GTT 
Phe Trp Val 
635 

AAT CGC AAA 
Asn Arg Lys 
650 

AGT GCT CAC 
Ser Ala His 



CTC TTT GCT 
Leu Phe Ala 



ACC TAG GGT 
Thr Tyr Gly 
' 700 

C7\A AAC TAT 
Gin Asn Tyr 
715 

GAA AAA TTC 
Glu Lys Phe 
730 

TTC AGC CAT 
Phe Ser His 



GAA TCC GAA 
Glu Ser Glu 



CTA GAC CTT 
Leu Asp Leu 
780 

TTC ATT CCA 
Phe lie Pro 
795 

TTC TTC GAA 
Phe Phe Glu 
810 

CTT AAC CTC 
Leu Asn Leu 



GTA GAG ATC 
Val Glu lie 



TCC TCC ATG 
Ser Ser Met 



GGC TTC CGT 
Gly Phe Arg 
655 

ACT CCT AAA 
Thr Pro Lys 
670 

AGA GAC AAA 
Arg Asp Lys 
685 

GGA ACT TTA 
Gly Thr Leu 

TTG AGA TTA 
Leu Arg Leu 



CCT AGG GAA 
Pro Arg Glu 
735 

TCA GAC AAC 
Ser Asp Asn 
750 

GGT TCT TGG 
Gly Ser Trp 
765 

CCT TTT GTT 
Pro Phe Val 



CAG ATG AAA 
Gin Met Lys 



AGC TCT AGT 
Ser Ser Ser 
815 

TCG ATT CCT 
Ser lie Pro 
830 



GGC GCA ACT 
Gly Ala Thr 
625 

ACG AAC TTC 
Thr Asn Phe 
640 

CAT ACC TCT 
His Thr Ser 



GAC GAC CTA 
Asp Asp Leu 



GAT TGT TTT 
Asp Cys Phe 
690 

TTC TTC AAG 
Phe Phe Lys 
705 

GGA AGA GCA 
Gly Arg Ala 
720 

ATT CCC CTA 
lie Pro Leu 



CGT ATG GAA 
Arg Met Glu 



AGC AAC GAG 
Ser Asn Glu 
770 

CTT TCC AAC 
Leu Ser Asn 
785 

GTC GAA ATG 
Val Glu Met 
800 

GAT GGC CGT 
Asp Gly Arg 



GTG GGT GCG 
Val Gly Ala 



GGT ATG GAA 
Gly Met Glu 



CTG CAT AAG 
Leu His Lys 
645 

GGA GGC TAC 
Gly Gly Tyr 
660 

TTT ACC TTT 
Phe Thr Phe 
675 

ATC GCT CAC 
lie Ala His 



CAC TCT CAT 
His Ser His 



AAG TTT TCT 
Lys Phe Ser 
725 

GCC TTG GAT 
Ala Leu Asp 
740 

ACG CAC TAT 
Thr His Tyr 
755 

TGT ATA GCT 
Cys lie Ala 



CCA CAT CCT 
Pro His Pro 



GTT TAT GTA 
Val Tyr Val 
805 

GGT TTT AGT 
Gly Phe Ser 
820 

AAA TTC GTG 
Lys Phe Val 
835 



CAC AAA CAA 
His Lys Gin 
630 

ACT GGA GAT 
Thr Gly Asp 



GTC ATC GGT 
Val lie Gly 



GCG TTC TGC 
Ala Phe Cys 
680 

AAC AAC TCT 
Asn Asn Ser 
695 

ACC CTA CAA 
Thr Leu Gin 
710 

GAA TCA GCT 
Glu Ser Ala 



GTC CAA GTT 
Val Gin Val 



ACC TCA TTG 
Thr Ser Leu 
760 

GGT GGT ATC 
Gly Gly lie 
775 

CTT TTC AAG 
Leu Phe Lys 
790 

TCA CAA AAT 
Ser Gin Asn 



ATT GGA AGG 
lie Gly Arg 

CAG GGG GAT 
Gin Gly Asp 
840 



GGT 2103 
Gly 



GAA 2151 
Glu 



GGA 219 9 

Gly 

665 

CAT 22 4 7 

His 



AGA 22 9 5 

Arg 



CCC 2343 
Pro 



ATA 2391 
lie 



TCG 2439 

Ser 

745 

CCA 2487 
Pro 



GGC 2535 
Gly 



ACC 2583 
Thr 



AGC 2 631 

Ser 



CTG 2 67 9 

Leu 

825 

ATC 2727 
He 



40 



(!>( 



GGA GAT TCC mc ACC TAT GAT CTC TCA GGA TTC TTT GTT TCC GAT GTC 
Gly Asp Ser Tyx Thr Tyr Asp Leu Ser Gly Phe Phe Val Ser Asp Val 
845^ 850 855 
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TAT CGT AAC AAT 
Tyr Arg Asn Asn 
860 



CAA TCT ACA GCG ACT CTT GTG ATG AGC CCA GAC 
:o Gin Ser Thr Ala Thr Leu Val Met Ser Pro Asp 
865 870 
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TCT TGG AAA ATT CGG GGT GGC AAT CTT TCA AGA CAG GCA TTT TTA CTG 
Ser Trp Lys lie Arg\ Gly Gly Asn Leu Ser Arg Gin Ala Phe Leu Leu 
875 \ 880 885 
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AGG GGT AGC AAC AAC ISAC GTC TAC AAC TCC AAT TGT GAG CTC TTC GGA 
Arg Gly Ser Asn Asn TVr Val Tyr Asn Ser Asn Cys Glu Leu Phe Gly 
890 89^ 900 905 



2919 



CAT TAC GCT ATG GAA CTCXCGT GGA TCT TCA AGG AAC TAC AAT GTA GAT 
His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn Tyr Asn Val Asp 
910 \ 915 920 



2967 



GTT GGT ACC AAA CTC CGA TT\ CTAGATTGCT AAAACTCCCT AGTTCTTCTA GGGAG 
Val Gly Thr Lys Leu Arg Ph^ 
925 



3022 



TTTTCTCATA CTTTTAGGGA TVATATTT^CT ATAGGGAATG CTTTCCTTGC AAACTGTAAA 30 82 
AAATAACATT TGTCCCTCTT CAAAAAAQAT TTCTTTTAAT AATTTCTAGT TATAATTTTA 3142 
TTTTAAAAAC AGTTAAATAA TTAATAGAOIA ATAATCTATT CTTATTGACT TCTTTTTT 32 00 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTIC^ 

(A) LENGTH: 92 8 amino ac\ds 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
*{v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID r^O : 2 : 

Met Lys Thr Ser lie Pro Trp Val Leu Val S\r Ser Val Leu Ala Phe 

15 

Ser Cys His Leu Gin Ser Leu Ala Asn Glu GliA Leu Leu Ser Pro Asp 
20 25 \ 30 

Asp Ser Phe Asn Gly Asn lie Asp Ser Gly Thr the Thr Pro Lys Thr 
35 40 \ 45 



Ser Ala Thr Thr Tyr Ser Leu Thr Gly Asp Val 
50 



Phe Tyr Glu Pro 



Gly Lys Gly Thr Pro Leu Ser Asp Ser Cys Phe LysXGln Thr Thr Asp 



65 



80 
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Asn Leu Thr Phe 



Asp Ala Gly Thr 
100 

Asn Leu Thr Phe 
115 

Ser Thr Thr Val 
130 

Val Asn Leu Glu 
145 

Thr Ala Asp Gly 



Thr Ser Gly Asp 
180 

Gly Ala lie Ala 
195 

Tyr Val Arg Phe 
210 

Asp Asp Glu Gly 
225 

Glu Gly Asn Ala 



Ala Ser Gly Ser 
260 

Phe Ala Ser Asn 
275 

Lys Leu Ala Leu 
290 

Val Ser Ser Ala 
305 

Gly Glu Leu Ser 



Asn Thr Leu Thr 
340 

lie Asn lie Gly 
355 

Asn His Thr lie 
370 

Ser Asp Val Leu 
385 



Leu Gly Asn Gly 
85 

His Ala Gly Ala 

Ser Gly Phe Ser 
120 

Thr Thr Gly Gin 
135 

Asn lie Arg Lys 
150 

Gly Ala lie Lys 
165 

Ala Leu Phe Ser 



Thr Thr Ala Gly 
200 

Leu Ser Asn He 
215 

Thr Ser He Leu 
230 

Ala Lys Thr Thr 
245 

Pro Glu Leu He 



Val Ala Glu Thr 
280 

Ser Ser Gly Gly 
295 

Thr Pro Lys Gly 
310 

Leu Ser Ala Glu 
325 

Thr Thr Gly Ser 



Ser Asn Gly Lys 
360 

Phe Phe Tyr Asp 
375 

Lys He Asn Asn 
390 



His Ser Leu Thr 
90 

Ala Ala Ser Thr 
105 

Leu Leu Ser Phe 



Gly Thr Leu Ser 
140 

Leu Val Val Ala 
155 

Gly Ala Ser Phe 
170 

Asn Asn Ser Ser 
185 

Ala Arg He Ala 



Ala Ser Thr Ser 
220 

Ser Asn Asn Lys 
235 

Gly Gly Ala He 
250 

He Ser Asn Asn 
265 

Ser Gly Gly Ala 



Phe Thr Glu Phe 
300 

Gly Ala He Ser 
315 

Thr Gly Asn He 
330 

Thr Asp Thr Pro 
345 

Phe Thr Glu Leu 



Pro He Thr Ser 
380 

Gly Ser Ala Gly 
395 



Phe Gly Phe He 
95 

Thr Ala Asn Lys 
110 

Asp Ser Ser Pro 
125 

Ser Ala Gly Gly 



Gly Asn Phe Ser 
160 

Leu Leu Thr Gly 
175 

Ser Thr Lys Gly 
190 

Asn Asn Thr Gly 
205 

Gly Gly Ala He 



Phe Leu Tyr Phe 
240 

Cys Asn Thr Lys 
255 

Lys Thr Leu He 
270 

He His Ala Lys 
285 

Leu Arg Asn Asn 



He Asp Ala Ser 
320 

Thr Phe Val Arg 
335 

Lys Arg Asn Ala 
350 

Arg Ala Ala Lys 
365 

Glu Gly Thr Ser 



Ala Leu Asn Pro 
400 



42 



Tyr Gin Gly Thr lie Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu 
405 410 415 

Leu Lys Val Ala Asp Asn Leu Lys Ser Ser Phe Thr Gin Pro Val Ser 
420 425 430 

Leu Ser Gly Gly Lys Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser 
435 440 445 

Thr Ser Phe Ser Gin Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly 
450 455 460 

Thr Thr Leu Ser Thr Thr Ala Gly Ser lie Thr lie Thr Asn Leu Gly 
465 470 475 480 

lie Asn Val Asp Ser Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala 
485 490 495 

Lys Gly Ala Ser Asn Lys Val lie Val Ser Gly Lys Leu Asn Leu lie 
500 505 510 

Asp lie Glu Gly Asn lie Tyr Glu Ser His Met Phe Ser His Asp Gin 
515 520 525 

Leu Phe Ser Leu Leu Lys lie Thr Val Asp Ala Asp Val Asp Thr Asn 
530 535 540 

Val Asp lie Ser Ser Leu lie Pro Val Pro Ala Glu Asp Pro Asn Ser 
545 550 555 560 

Glu Tyr Gly Phe Gin Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr 
565 570 . 575 

Ala Thr Asn Thr Lys Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe 
580 585 590 

Val Pro Ser Pro Glu Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp 
595 600 605 

Gly Val Phe Thr Asp lie Arg Ser Leu Gin Gin Leu Val Glu lie Gly 
610 615 620 

Ala Thr Gly Met Glu His Lys Gin Gly Phe Trp Val Ser Ser Met Thr 
625 630 635 640 

Asn Phe Leu His Lys Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His 
645 650 655 

Thr Ser Gly Gly Tyr Val lie Gly Gly Ser Ala His Thr Pro Lys Asp 
660 665 670 

Asp Leu Phe Thr Phe Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp 
675 680 685 



Cys Phe lie Ala His Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe 
690 695 700 



m 
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Phe Lys His Ser 
705 

Arg Ala Lys Phe 



Pro Leu Ala Leu 
740 

Met Glu Thr His 
755 

Asn Glu Cys lie 
770 

Ser Asn Pro His 
785 

Glu Met Val Tyr 

Gly Arg Gly Phe 
820 

Gly Ala Lys Phe 
835 

Leu Ser Gly Phe 
850 

Thr Ala Thr Leu 
865 

Asn Leu Ser Arg 



Tyr Asn Ser Asn 
900 

Gly Ser Ser Arg 
915 



His Thr Leu Gin 
710 

Ser Glu Ser Ala 
725 

Asp Val Gin Val 



Tyr Thr Ser Leu 
760 

Ala Gly Gly lie 
775 

Pro Leu Phe Lys 
790 

Val Ser Gin Asn 
805 

Ser lie Gly Arg 



Val Gin Gly Asp 
840 

Phe Val Ser Asp 
855 

Val Met Ser Pro 
870 

Gin Ala Phe Leu 
885 

Cys Glu Leu Phe 



Asn Tyr Asn Val 
920 



Pro Gin Asn Tyr 
715 

lie Glu Lys Phe 
730 

Ser Phe Ser His 
745 

Pro Glu Ser Glu 



Gly Leu Asp Leu 
780 

Thr Phe lie Pro 
795 

Ser Phe Phe Glu 
810 

Leu Leu Asn Leu 
825 

lie Gly Asp Ser 



Val Tyr Arg Asn 
860 

Asp Ser Trp Lys 
875 

Leu Arg Gly Ser 
890 

Gly His Tyr Ala 
905 

Asp Val Gly Thr 



Leu Arg Leu Gly 
720 

Pro Arg Glu lie 
735 

Ser Asp Asn Arg 
750 

Gly Ser Trp Ser 
765 

Pro Phe Val Leu 



Gin Met Lys Val 
800 

Ser Ser Ser Asp 
815 

Ser lie Pro Val 
830 

Tyr Thr Tyr Asp 
845 

Asn Pro Gin Ser 



lie Arg Gly Gly 
880 

Asn Asn Tyr Val 
895 

Met Glu Leu Arg 
910 

Lys Leu Arg Phe 
925 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATGAAATCGC AATTTTCCTG GTTAGTGCTC TCTTCGACAT TGGCATGTTT TACTAGTTGT 60 

TCCACTGTTT TTGCTGCAAC TGCTGAAAAT ATAGGCCCCT CTGATAGCTT TGACGGAAGT 120 

ACTAACACAG GCACCTATAC TCCTAAAAAT ACGACTACTG GAATAGACTA TACTCTGACA 180 

GGAGATATAA CTCTGCAAAA CCTTGGGGAT TCGGCAGCTT TAACGAAGGG TTGTTTTTCT 240 
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GACACTACGG AATCTTTAAG CTTTGCCGGT AAGGGGTACT CACTTTCTTT TTTAAATATT 30 0 

AAGTCTAGTG CTGAAGGCGC AGCACTTTCT GTTACAACTG ATAAAAATCT GTCGCTAACA 360 

GGATTTTCGA GTCTTACTTT CTTAGCGGCC CCATCATCGG TAATCACAAC CCCCTCAGGA 42 0 

AAAGGTGCAG TTAAATGTGG AGGGGATCTT ACATTTGATA ACAATGGAAC TATTTTATTT 4 80 

AAACAAGATT ACTGTGAGGA AAATGGCGGA GCCATTTCTA CCAAGAATCT TTCTTTGAAA 54 0 

AACAGCACGG GATCGATTTC TTTTGAAGGG AATAAATCGA GCGCAACAGG GAAAAAAGGT 60 0 

GGGGCTATTT GTGCTACTGG TACTGTAGAT ATTACAAATA ATACGGCTCC TACCCTCTTC 660 

TCGAACAATA TTGCTGAAGC TGCAGGTGGA GCTATAAATA GCACAGGAAA CTGTACAATT 72 0 

ACAGGGAATA CGTCTCTTGT ATTTTCTGAA AATAGTGTGA CAGCGACCGC AGGAAATGGA 7 80 

GGAGCTCTTT CTGGAGATGC CGATGTTACC ATATCTGGGA ATCAGAGTGT AACTTTCTCA 8 40 

GGAAACCAAG CTGTAGCTAA TGGCGGAGCG ATTTATGCTA AGAAGCTTAC ACTGGCTTCC 900 

GGGGGGGGGG GGGGTATCTC CTTTTCTAAC AATATAGTCC AAGGTACCAC TGCAGGTAAT 9 60 

GGTGGAGCCA- TTTCTATACT GGCAGCTGGA GAGTGTAGTC TTTCAGCAGA AGCAGGGGAC 102 0 

ATTACCTTCA ATGGGAATGC CATTGTTGCA ACTACACCAC AAACTACAAA AAGAAATTCT 108 0 

ATTGACATAG GATCTACTGC AAAGATCACG AATTTACGTG CAATATCTGG GCATAGCATC 114 0 

TTTTTCTACG ATCCGATTAC TGCTAATACG GCTGCGGATT CTACAGATAC TTTAAATCTC 12 00 

AATAAGGCTG ATGCAGGTAA TAGTACAGAT TATAGTGGGT CGATTGTTTT TTCTGGTGAA 12 60 

AAGCTCTCTG AAGATGAAGC AAAAGTTGCA GACAACCTCA CTTCTACGCT GAAGCAGCCT 132 0 

GTAACTCTAA CTGCAGGAAA TTTAGTACTT AAACGTGGTG TCACTCTCGA TACGAAAGGC 13 8 0 

TTTACTCAGA CCGCGGGTTC CTCTGTTATT ATGGATGCGG GCACAACGTT AAAAGCAAGT 14 4 0 

ACAGAGGAGG TCACTTTAAC AGGTCTTTCC ATTCCTGTAG ACTCTTTAGG CGAGGGTAAG 15 00 

AAAGTTGTAA TTGCTGCTTC TGCAGCAAGT 7WWVTGTAG CCCTTAGTGG TCCGATTCTT 15 60 

CTTTTGGATA ACCAAGGGAA TGCTTATGAA AATCACGACT TAGGAAAAAC TCAAGACTTT 162 0 

TCATTTGTGC AGCTCTCTGC TCTGGGTACT GCAACAACTA CAGATGTTCC AGCGGTTCCT 168 0 

ACAGTAGCAA CTCCTACGCA CTATGGGTAT CAAGGTACTT GGGGAATGAC TTGGGTTGAT 17 4 0 

GATACCGCAA GCACTCCAAA GACTAAGACA GCGACATTAG CTTGGACCAA TACAGGCTAC 18 00 

CTTCCGAATC CTGAGCGTCA AGGACCTTTA GTTCCTAATA GCCTTTGGGG ATCTTTTTCA 18 60 

GACATCCAAG CGATTCAAGG TGTCATAGAG AGAAGTGCTT TGACTCTTTG TTCAGATCGA 192 0 

GGCTTCTGGG CTGCGGGAGT CGCCAATTTC TTAGATAAAG ATAAGAAAGG GGTU^AAACGC 198 0 

AAATACCGTC ATAAATCTGG TGGATATGCT ATCGGAGGTG CAGCGCAAAC TTGTTCTGAA 2 04 0 

AACTTAATTA GCTTTGCCTT TTGCCAACTC TTTGGTAGCG ATAAAGATTT CTTAGTCGCT 2100 

AAAAATCATA CTGATACCTA TGCAGGAGCC TTCTATATCC AACACATTAC AGAATGTAGT 2160 

GGGTTCATAG GTTGTCTCTT AGATAAACTT CCTGGCTCTT GGAGTCATAA ACCCCTCGTT 222 0 

TTAGAAGGGC AGCTCGCTTA TAGCCACGTC AGTAATGATC TGAAGACAAA GTATACTGCG 22 8 0 

TATCCTGAGG TGAAAGGTTC TTGGGGGAAT AATGCTTTTA ACATGATGTT GGGAGCTTCT 234 0 

TCTCATTCTT ATCCTGAATA CCTGCATTGT TTTGATACCT ATGCTCCATA CATCAAACTG 24 00 

AATCTGACCT ATATACGTCA GGACAGCTTC TCGGAGAAAG GTACAGAAGG AAGATCTTTT 24 60 

GATGACAGCA ACCTCTTCAA TTTATCTTTG CCTATAGGGG TGAAGTTTGA GAAGTTCTCT 252 0 

GATTGTAATG ACTTTTCTTA TGATCTGACT TTATCCTATG TTCCTGATCT TATCCGCAAT 258 0 

GATCCCAAAT GCACTACAGC ACTTGTi^ATC AGCGGAGCCT CTTGGGAAAC TTATGCCAAT 2 64 0 

AACTTAGCAC GACAGGCCTT GCAAGTGCGT GCAGGCAGTC ACTACGCCTT CTCTCCTATG 27 00 

TTTGAAGTGC TCGGCCAGTT TGTCTTTGAA GTTCGTGGAT CCTCACGGAT TTATAATGTA 2 7 60 

GATCTTGGGG GTAAGTTCCA ATTCTAGGAG CGTCTCTCAT GTCTCAGAAA TTCTG 2 815 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys 
15 10 15 
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Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr Ala Glu Asn lie Gly 
20 25 30 

Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro 
35 40 45 

Lys Asn Thr Thr Thr Gly lie Asp Tyr Thr Leu Thr Gly Asp lie Thr 
50 55 60 

Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser 
65 70 75 80 

Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser 
85 90 95 

Phe Leu Asn lie Lys Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr 
100 105 110 

Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu 
115 120 125 

Ala Ala Pro Ser Ser Val Ile'Thr'Thr Pro Ser Gly Lys Gly Ala Val 
130 135 140 

Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn Gly Thr lie Leu Phe 
145 150 155 160 

Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala lie Ser Thr Lys Asn 
165 170 175 

Leu Ser Leu Lys Asn Ser Thr Gly Ser lie Ser Phe Glu Gly Asn Lys 
180 185 190 

Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala lie Cys Ala Thr Gly Thr 
195 200 205 

Val Asp lie Thr Asn Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn lie 
210 215 220 

Ala Glu Ala Ala Gly Gly Ala lie Asn Ser Thr Gly Asn Cys Thr lie 
225 230 235 240 

Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr 
245 250 255 

Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala Asp Val Thr lie Ser 
260 265 270 

Gly Asn Gin Ser Val Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly 
275 280 285 

Gly Ala lie Tyr Ala Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly 
290 295 300 



Gly lie Ser Phe Ser Asn Asn lie Val Gin Gly Thr Thr Ala Gly Asn 
305 310 315 320 
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Gly Gly Ala lie Ser lie Leu Ala Ala Gly Glu Cys Ser Leu Ser Ala 
325 330 335 

Glu Ala Gly Asp lie Thr Phe Asn Gly Asn Ala lie Val Ala Thr Thr 
340 345 350 

Pro Gin Thr Thr Lys Arg Asn Ser lie Asp lie Gly Ser Thr Ala Lys 
355 360 365 

lie Thr Asn Leu Arg Ala lie Ser Gly His Ser He Phe Phe Tyr Asp 
370 375 380 

Pro lie Thr Ala Asn Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu 
385 390 395 400 

Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr Ser Gly Ser He Val 
405 410 415 

Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn 
420 425 430 

Leu Thr Ser Thr Leu Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu 
435 ' 440 445 

Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr 
450 455 460 

Ala Gly Ser Ser Val He Met Asp Ala Gly Thr Thr Leu Lys Ala Ser 
465 470 475 480 

Thr Glu Glu Val Thr Leu Thr Gly Leu Ser He Pro Val Asp Ser Leu 
485 490 ' 495 

Gly Glu Gly Lys Lys Val Val He Ala Ala Ser Ala Ala Ser Lys Asn 
500 505 510 

Val Ala Leu Ser Gly Pro He Leu Leu Leu Asp Asn Gin Gly Asn Ala 
515 520 525 

Tyr Glu Asn His Asp Leu Gly Lys Thr Gin Asp Phe Ser Phe Val Gin 
530 535 540 

Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp Val Pro Ala Val Pro 
545 550 555 560 

Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin Gly Thr Trp Gly Met 
565 570 575 

Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr 
580 . 585 590 

Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gin Gly 
595 600 605 



Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe Ser Asp He Gin Ala 
610 615 620 
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lie Gin Gly Val lie Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg 
625 630 635 640 

Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys 
645 650 655 

Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly Gly Tyr Ala lie Gly 
660 665 670 

Gly Ala Ala Gin Thr Cys Ser Glu Asn Leu lie Ser Phe Ala Phe Cys 
675 680 685 

Gin Leu Phe Gly Ser Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr 
690 695 700 

Asp Thr Tyr Ala Gly Ala Phe Tyr lie 'Gin His lie Thr Glu Cys Ser 
705 710 715 720 

Gly Phe lie Gly Cys Leu Leu Asp Lys Leu Pro Gly Ser Trp Ser His 
725 730 735 

Lys Pro Leu Val Leu Glu Gly Gin Leu Ala Tyr Ser His Val Ser Asn 
740 745 750 

Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp 
755 760 765 

Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr 
770 775 - 780 

Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala Pro Tyr lie Lys Leu 
785 790 795 800 

Asn Leu Thr Tyr lie Arg Gin Asp Ser Phe Ser Glu Lys Gly Thr Glu 
805 810 815 

Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro lie 
820 825 830 

Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp 
835 840 845 

Leu Thr Leu Ser Tyr Val Pro Asp Leu lie Arg Asn Asp Pro Lys Cys 
850 855 860 

Thr Thr Ala Leu Val lie Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn 
865 870 875 880 

Asn Leu Ala Arg Gin Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala 
885 890 895 

Phe Ser Pro Met Phe Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg 
900 905 910 

Gly Ser Ser Arg lie Tyr Asn Val Asp Leu Gly Gly Lys Phe Gin Phe 
915 920 925 



(2) INFORMATION FOR SEQ ID NO : 5 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



ATGCGATTTT CGCTCTGCGG ATTTCCTCTA GTTTTTTCTT TAACATTGCT CTCAGTCTTC 60 

GACACTTCTT TGAGTGCTAC TACGATTTCT TTAACCCCAG AAGATAGTTT TCATGGAGAT 12 0 

AGTCAGAATG CAGAACGTTC TTATAATGTT CAAGCTGGGG ATGTCTATAG CCTTACTGGT 18 0 

GATGTCTCAA TATCTAACGT CGATAACTCT GCATTAAATA AAGCCTGCTT CAATGTGACC 24 0 

TCAGGAAGTG TGACGTTCGC AGGAAATCAT CATGGGTTAT ATTTTAATAA TATTTCCTCA 3 00 

GGAACTACAA AGGAAGGGGC TGTACTTTGT TGCCAAGATC CTCAAGCAAC GGCACGTTTT 3 60 

TCTGGGTTCT CCACGCTCTC TTTTATTCAG AGCCCCGGAG ATATTAAAGA ACAGGGATGT 4 20 

CTCTATTCAA AAAATGCACT TATGCTCTTA AACAATTATG TAGTGCGTTT TGAACAAAAC 4 80 

CAAAGTAAGA CTAAAGGCGG AGCTATTAGT GGGGCGAATG TTACTATAGT AGGCAACTAC 54 0 

GATTCCGTCT CTTTCTATCA GAATGCAGCC ACTTTTGGAG GTGCTATCCA TTCTTCAGGT 6 00 

CCCCTACAGA TTGCAGTAAA TCAGGCAGAG ATAAGATTTG CACAAAATAC TGCCAAGAAT 6 60 

GGTTCTGGAG GGGCTTTGTA CTCCGATGGT GATATTGATA TTGATCAGAA TGCTTATGTT "7 2 0 

CTATTTCGAG AAAATGAGGC ATTGACTACT GCTATAGGTA AGGGAGGGGC TGTCTGTTGT 7 80 

CTTCCCACTT CAGGAAGTAG TACTCCAGTT CCTATTGTGA CTTTCTCTGA CAATAAACAG 8 40 

TTAGTCTTTG AAAGAAACCA TTCCATAATG GGTGGCGGAG CCATTTATGC TAGGAT^ACTT 9 00 

AGCATCTCTT CAGGAGGTCC TACTCTATTT ATCAATAATA TATCATATGC AAATTCGCAA 960 

AATTTAGGTG GAGCTATTGC CATTGATACT GGAGGGGAGA TCAGTTTATC AGCAGAGAAA 102 0 

GGAACAATTA CATTCCAAGG AAACCGGACG AGCTTACCGT TTTTGAATGG CATCCATCTT 10 8 0 

TTACAAAATG CTAAATTCCT GAAATTACAG GCGAGAAATG GATGCTCTAT AGAATTTTAT 114 0 

GATCCTATTA CTTCTGAAGC AGATGGGTCT ACCCAATTGA ATATCAACGG AGATCCTAAA 12 00 

AATAAAGAGT ACACAGGGAC CATACTCTTT TCTGGAGAAA AGAGTCTAGC AAACGATCCT 12 60 

AGGGATTTTA AATCTACAAT CCCTCAGAAC GTCAACCTGT CTGCAGGATA CTTAGTTATT 132 0 

AAAGAGGGGG CCGAAGTCAC AGTTTCAAAA TTCACGCAGT CTCCAGGATC GCATTTAGTT 13 8 0 

TTAGATTTAG GAACCAAACT GATAGCCTCT AAGGAAGACA TTGCCATCAC AGGCCTCGCG 14 4 0 

ATAGATATAG ATAGCTTAAG CTCATCCTCA ACAGCAGCTG TTATTAAAGC AAACACCGCA 150 0 

AATAAACAGA TATCCGTGAC GGACTCTATA GAACTTATCT CGCCTACTGG CAATGCCTAT 1560 

GAAGATCTCA GAATGAGAAA TTCACAGACG TTCCCTCTGC TCTCTTTAGA GCCTGGAGCC 162 0 

GGGGGTAGTG TGACTGTAAC TGCTGGAGAT TTCCTACCGG TAAGTCCCCA TTATGGTTTT 168 0 

CAAGGCAATT GGAAATTAGC TTGGACAGGA ACTGGAAACA AAGTTGGAGA ATTCTTCTGG 17 4 0 

GATAAAATAA ATTATAAGCC TAGACCTGAA AAAGAAGGAA ATTTAGTTCC TAATATCTTG 18 00 

TGGGGGAATG CTGTAAATGT CAGATCCTTA ATGCAGGTTC AAGAGACCCA TGCATCGAGC 18 60 

TTACAGACAG ATCGAGGGCT GTGGATCGAT GGAATTGGGA ATTTCTTCCA TGTATCTGCC 192 0 

TCCGAAGACA ATATAAGGTA CCGTCATAAC AGCGGTGGAT ATGTTCTATC TGTAAATAAT 19 8 0 

GAGATCACAC CTAAGCACTA TACTTCGATG GCATTTTCCC AACTCTTTAG TAGAGACAAG 2 04 0 

GACTATGCGG TTTCCAACAA CGAATACAGA ATGTATTTAG GATCGTATCT CTATCAATAT 2100 

ACAACCTCCC TAGGGAATAT TTTCCGTTAT GCTTCGCGTA ACCCTAATGT AAACGTCGGG 2160 

ATTCTCTCAA GAAGGTTTCT TCAAAATCCT CTTATGATTT TTCATTTTTT GTGTGCTTAT 222 0 

GGTCATGCCA CCAATGATAT GAAAACAGAC TACGCAAATT TCCCTATGGT GAAAAACAGC 22 8 0 

TGGAGAAACA ATTGTTGGGC TATAGAGTGC GGAGGGAGCA TGCCTCTATT GGTATTTGAG 234 0 

AACGGAAGAC TTTTCCAAGG TGCCATCCCA TTTATGAAAC TACAATTAGT TTATGCTTAT 24 00 

CAGGGAGATT TCAAAGAGAC GACTGCAGAT GGCCGTAGAT TTAGTAATGG GAGTTTAACA 24 60 

TCGATTTCTG TACCTCTAGG CATACGCTTT GAGAAGCTGG CACTTTCTCA GGATGTACTC 252 0 

TATGACTTTA GTTTCTCCTA TATTCCTGAT ATTTTCCGTA AGGATCCCTC ATGTGAAGCT 258 0 

GCTCTGGTGA TTAGCGGAGA CTCCTGGCTT GTTCCGGCAG CACACGTATC AAGACATGCT 2 64 0 

TTTGTAGGGA GTGGAACGGG TCGGTATCAC TTTAACGACT ATACTGAGCT CTTATGTCGA 2700 

GGAAGTATAG AATGCCGCCC CCATGCTAGG AATTATAATA TAAACTGTGG AAGCAAATTT 27 60 

CGTTTTTAGA AGGTTTCCAT TGCCTGTGTG GTTCCGGATC TTAACTATAA ATCCTGGACT 2 820 
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ATGGATCATA GGCATteCGT TTCTCGAACT TGTGTGGAGA ATAACGACAT TTTATATGCA 28 8 0 

TAACGGAATA CTCGTA^CAC CTCAGCCCCT AGAGACATTC TTTAGGGGTT CTTTATTTGT 2 94 0 

CTAAACTTCG TATTTTATCG AGAATCCTTT ACGTTCTTGG TTTGCTTGTC TCCGAGGAGT 3000 

TCTCTAACGA ATCATAGGfeA TTCCAGGGTT CTGTTCCTTG AGTCCTTTGG A 3051 

(2) INFORMATION FOR \eQ ID NO : 6 : 

(i) SEQUENCE CHAfeACTERISTICS : 

(A) LENGTH: 9^2 amino acids 

(B) TYPE: amirio acid 

(C) STRANDEDNEa^S : single 

(D) TOPOLOGY: iVnear 

(ii) MOLECULE TYPE: p^Dtide 

(xi) SEQUENCE DESCRIPTrON: SEQ ID NO : 6 : 

Met Arg Phe Ser Leu Cys Gly Ph^ Pro Leu Val Phe Ser Leu Thr Leu 

1 5 \ 10 15 , 

Leu Ser Val Phe Asp Thr Ser Leu Seer Ala Thr Thr lie Ser Leu Thr 
20 2\ 30 

Pro Glu Asp Ser Phe His Gly Asp Ser Vln Asn Ala Glu Arg Ser Tyr 
35 40 \ , 45 

Asn Val Gin Ala Gly Asp Val Tyr Ser Le\ Thr Gly Asp Val Ser lie 
50 55 \ 60 

Ser Asn Val Asp Asn Ser Ala Leu Asn Lys AJsa Cys Phe Asn Val Thr 
65 70 7^ 80 

Ser Gly Ser Val Thr Phe Ala Gly Asn His His yly Leu Tyr Phe Asn 
85 90 \ 95 

Asn lie Ser Ser Gly Thr Thr Lys Glu Gly Ala VaV Leu Cys Cys Gin 
100 105 >y 110 

Asp Pro Gin Ala Thr Ala Arg Phe Ser Gly Phe SerVThr Leu Ser Phe 
115 120 ^25 

lie Gin Ser Pro Gly Asp lie Lys Glu Gin Gly Cys Lteu Tyr Ser Lys 
130 135 140 \ 

Asn Ala Leu Met Leu Leu Asn Asn Tyr Val Val Arg Phfe Glu Gin Asn 
145 150 155 . \ 160 

Gin Ser Lys Thr Lys Gly Gly Ala lie Ser Gly Ala Asn Val Thr lie 
165 170 \ 175 

Val Gly Asn Tyr Asp Ser Val Ser Phe Tyr Gin Asn Ala Alav Thr Phe 
180 185 190\ 

Gly Gly Ala lie His Ser Ser Gly Pro Leu Gin lie Ala Val A^n Gin 
195 200 205 \ 



Ala Glu lie Arg Phe Ala Gin Asn Thr Ala Lys Asn Gly Ser Gly \^ly 
210 215 220 



50 



Ala Leu Tyr Ser Asp Gly Asp lie Asp lie Asp Gin Asn Ala Tyr Val 
225 230 235 240 

Leu Phe Arg Glu Asn Glu Ala Leu Thr Thr Ala lie Gly Lys Gly Gly 
245 250 255 

Ala Val Cys Cys Leu Pro Thr Ser Gly Ser- Ser Thr Pro Val Pro lie 
260 265 270 

Val Thr Phe Ser Asp Asn Lys Gin Leu Val Phe Glu Arg Asn His Ser 
275 280 285 

lie Met Gly Gly Gly Ala lie Tyr Ala Arg Lys Leu Ser lie Ser Ser 

■ 290 295 300 

Gly Gly Pro Thr Leu Phe lie Asn Asn lie Ser Tyr Ala Asn Ser Gin 

305 310 315 320 

Asn Leu Gly Gly Ala lie Ala lie Asp Thr Gly Gly Glu lie Ser Leu 
325 330 335 

. Ser Ala Glu Lys Gly Thr lie Thr Phe Gin Gly Asn Arg Thr Ser Leu 
340 345 350 

Pro Phe Leu Asn Gly lie His Leu Leu Gin Asn Ala Lys Phe Leu Lys 
355 360 365 

Leu Gin Ala Arg Asn Gly Cys Ser lie Glu Phe Tyr Asp Pro lie Thr 
370 375 380 

Ser Glu Ala Asp Gly Ser Thr Gin Leu Asn lie Asn Gly Asp Pro Lys 
385 390 395 400 

Asn Lys Glu Tyr Thr Gly Thr lie Leu Phe Ser Gly Glu Lys Ser Leu 
405 410 415 

Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr lie Pro Gin Asn Val Asn 
420 425 430 

Leu Ser Ala Gly Tyr Leu Val lie Lys Glu Gly Ala Glu Val Thr Val 
435 440 445 

Ser Lys Phe Thr Gin Ser Pro Gly Ser His Leu Val Leu Asp Leu Gly 
450 455 460 

Thr Lys Leu lie Ala Ser Lys Glu Asp lie Ala lie Thr Gly Leu Ala 
465 470 475 480 

lie Asp lie Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala Val lie Lys 
485 490 495 

Ala Asn Thr Ala Asn Lys Gin lie Ser Val Thr Asp Ser lie Glu Leu 
500 505 510 



lie Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met Arg Asn Ser 
515 520 525 
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Gin Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly Gly Ser Val 
530 535 540 

Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His Tyr Gly Phe 
545 550 555 560 

Gin Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn Lys Val Gly 
565 570 575 

Glu Phe Phe Trp Asp Lys lie Asn Tyr Lys Pro Arg Pro Glu Lys Glu 
580 585 590 

Gly Asn Leu Val Pro Asn lie Leu Trp Gly Asn Ala Val Asn Val Arg 
595 600 605 

Ser Leu Met Gin Val Gin Glu Thr His Ala Ser ^Ser Leu Gin Thr Asp 
610 615 620 

Arg Gly Leu Trp lie Asp Gly lie Gly Asn Phe Phe His Val Ser Ala 
625 630 - 635 640 

Ser Glu Asp Asn lie Arg Tyr Arg His Asn Ser Gly Gly Tyr Val Leu 
645 650 655 

Ser Val Asn Asn Glu lie Thr Pro Lys His Tyr Thr Ser Met Ala Phe 
660 665 670 

Ser Gin Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser Asn Asn Glu 
675 680 685 

Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gin Tyr Thr Thr Ser Leu 
690 695 700 

Gly Asn lie Phe Arg Tyr Ala Ser Arg Asn Pro Asn Val Asn Val Gly 
705 710 715 720 

lie Leu Ser Arg Arg Phe Leu Gin Asn Pro Leu Met lie Phe His Phe 
725 730 735 

Leu Cys Ala Tyr Gly His Ala Thr Asn Asp Met Lys Thr Asp Tyr Ala 
740 745 750 

Asn Phe Pro Met Val Lys Asn Ser Trp Arg Asn Asn Cys Trp Ala lie 
755 760 765 

Glu Cys Gly Gly Ser Met Pro Leu Leu Val Phe Glu Asn Gly Arg Leu 
770 775 780 

Phe Gin Gly Ala lie Pro Phe Met Lys Leu Gin Leu Val Tyr Ala Tyr 
785 790 795 800 

Gin Gly Asp Phe Lys Glu Thr Thr Ala Asp Gly Arg Arg Phe Ser Asn 
805 810 815 

Gly Ser Leu Thr Ser lie Ser Val Pro Leu Gly lie Arg Phe Glu Lys 
820 825 830 



Leu Ala Leu Ser Gin Asp Val Leu Tyr Asp Phe Ser Phe Ser Tyr lie 



Pro Asp lie Phe Arg Lys Asp Pro Ser Cys Glu Ala Ala Leu Val lie 
850 855 860 

Ser Gly Asp Ser Trp Leu Val Pro Ala Ala His Val Ser Arg His Ala 
865 870 875 880 

Phe Val Gly Ser Gly Thr Gly Arg Tyr His Phe Asn Asp Tyr Thr Glu 
885 890 895 

Leu Leu Cys Arg Gly Ser lie Glu Cys Arg Pro His Ala Arg Asn Tyr 
900 905 910 

Asn lie Asn Cys Gly Ser Lys Phe Arg Phe 
915 920 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATGAAGATTC CACTCCGCTT TTTATTGATA TCATTAGTAC CTACGCTTTC TATGTCGAAT 60 

TTATTAGGAG CTGCTACTAC CGAAGAGCTA TCGGCTAGCA ATAGCTTCGA TGGAACTACA 12 0 

TCAACAACAA GCTTTTCTAG TAAAACATCA TCGGCTACAG ATGGCACCAA TTATGTTTTT 180 

AAAGATTCTG TAGTTATAGA AAATGTACCC AAAACAGGGG AAACTCAGTC TACTAGTTGT 2 40 

TTTAAAAATG ACGCTGCAGC TGGAGATCTA AATTTCTTAG GAGGGGGATT TTCTTTCACA 3 00 

TTTAGCAATA TCGATGCAAC CACGGCTTCT GGAGCTGCTA TTGGAAGTGA AGCAGCTAAT 3 60 

AAGACAGTCA CGTTATCAGG ATTTTCGGCA CTTTCTTTTC TTAAATCCCC AGCAAGTACA 4 20 

GTGACTAATG GATTGGGAGC TATCAATGTT AAAGGGAATT TAAGCCTATT GGATAATGAT 4 80 

AAGGTATTGA TTCAGGACAA TTTCTCAACA GGAGATGGCG GAGCAATTAA TTGTGCAGGC 54 0 

TCCTTGAAGA TCGCAAACAA TAAGTCCCTT TCTTTTATTG GAAATAGTTC TTCAACACGT 600 

GGCGGAGCGA TTCATACCAA AAACCTCACA CTATCTTCTG GTGGGGAAAC TCTATTTCAG 660 

GGGAATACAG CGCCTACGGC TGCTGGTAAA GGAGGTGCTA TCGCGATTGC AGACTCTGGC 720 

ACCCTATCCA TTTCTGGAGA CAGTGGCGAC ATTATCTTTG AAGGCAATAC GATAGGAGCT 780 

ACAGGAACCG TCTCTCATAG TGCTATTGAT TTAGGAACTA GCGCTAAGAT AACTGCGTTA 84 0 

CGTGCTGCGC AAGGACATAC GATATACTTT TATGATCCGA TTACTGTAAC AGGATCGACA 900 

TCTGTTGCTG ATGCTCTCAA TATTAATAGC CCTGATACTG GAGATAACAA AGAGTATACG 960 

GGAACCATAG TCTTTTCTGG AGAGAAGCTC ACGGAGGCAG AAGCTAAAGA TGAGAAGAAC 102 0 

CGCACTTCTA AATTACTTCA AAATGTTGCT TTTAAAAATG GGACTGTAGT TTTAAAAGGT 108 0 

GATGTCGTTT TAAGTGCGAA CGGTTTCTCT CAGGATGCAA ACTCTAAGTT GATTATGGAT 114 0 

TTAGGGACGT CGTTGGTTGC AAACACCGAA AGTATCGAGT TAACGAATTT GGAAATTAAT 12 00 

ATAGACTCTC TCAGGAACGG GAAAAAGATA AAACTCAGTG CTGCCACAGC TCAGAAAGAT 12 60 

ATTCGTATAG ATCGTCCTGT TGTACTGGCA ATTAGCGATG AGAGTTTTTA TCAAAATGGC 132 0 

TTTTTGAATG AGGACCATTC CTATGATGGG ATTCTTGAGT TAGATGCTGG GAAAGACATC 1380 

GTGATTTCTG CAGATTCTCG CAGTATAAAT GCTGTACAAT CTCCGTATGG CTATCAGGGA 14 4 0 

AAGTGGACAA TCAATTGGTC TACTGATGAT AAGAAAGCTA CGGTTTCTTG GGCAAAGCAA 1500 

AGTTTTAATC CCACTGCTGA GCAGGAGGCT CCGTTAGTTC CTAATCTTCT TTGGGGTTCT 15 60 

TTTATAGATG TTCGTCCCTT CCAAPJKTTTT ATAGAGCTAG GTACTGAAGG TGCTCCTTAC 1620 

GAAAAGAGAT TTTGGGTTGC AGGCATTTCC AATGTTTTGC ATAGGAGCGG TCGTGAAAAT 1680 
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CAAAGGAAAT TCCGTCATGT GAGTGGAGGT GCTGTAGTAG GTGCTAGCAC GAGGATGCCG 17 4 0 

GGTGGTGATA CCTTGTCTCT GGGTTTTGCT CAGCTCTTTG CGCGTGACAA AGACTACTTT 1800 

ATGAATACCA ATTTCGCAAA GACCTACGCA GGATCTTTAC GTTTGCAGCA CGATGCTTCC 1860 

CTATACTCTG TGGTGAGTAT CCTTTTAGGA GAGGGAGGAC TCCGCGAGAT CCTGTTGCCT 192 0 

TATGTTTCCA AGACTCTGCC GTGCTCTTTC TATGGGCAGC TTAGCTACGG CCATACGGAT 198 0 

CATCGCATGA AGACCGAGTC TCTACCCCCC CCCCCCCCGA CGCTCTCGAC GGATCATACT 2 04 0 

TCTTGGGGAG GATATGTCTG GGCTGGAGAG CTGGGAACTC GAGTTGCTGT TGAAAATACC 2100 

AGCGGCAGAG GATTTTTCCG AGAGTACACT CCATTTGTAA AAGTCCAAGC TGTTTACTCG 2160 

CGCCAAGATA GCTTTGTTGA ACTAGGAGCT ATCAGTCGTG ATTTTAGTGA TTCGCATCTT 222 0 

TATAACCTTG CGATTCCTCT TGGAATCAAG TTAGAGAAAC GGTTTGCAGA GCAATATTAT 22 8 0 

CATGTTGTAG CGATGTATTC TCCAGATGTT TGTCGTAGTA ACCCCAAATG TACGACTACC 234 0 

CTACTTTCCA ACCAAGGGAG TTGGAAGACC AAAGGTTCGA ACTTAGCAAG ACAGGCTGGT 24 00 

ATTGTTCAGG CCTCAGGTTT TCGATCTTTG GGAGCTGCAG CAGAGCTTTT CGGGAACTTT 2 4 60 

GGCTTTGAAT GGCGGGGATC TTCTCGTAGC TATAATGTAG ATGCGGGTAG CAAAATCAAA 252 0 

TTTTAG 252 6 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys lie Pro Leu Arg Phe Leu Leu lie Ser Leu Val Pro Thr Leu 
15 10 15 

Ser Met Ser Asn Leu Leu Gly Ala Ala Thr Thr Glu Glu Leu Ser Ala 
20 25 30 

Ser Asn Ser Phe Asp Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys 
35 40 45 

Thr Ser Ser Ala Thr Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val 
50 55 60 

Val lie Glu Asn Val Pro Lys Thr Gly Glu Thr Gin Ser Thr Ser Cys 
65 70 75 80 

Phe Lys Asn Asp Ala Ala Ala Gly Asp Leu Asn Phe Leu Gly Gly Gly 
85 90 95 

Phe Ser Phe Thr Phe Ser Asn lie Asp Ala Thr Thr Ala Ser Gly Ala 
100 105 110 

Ala lie Gly Ser Glu Ala Ala Asn Lys Thr Val Thr Leu Ser Gly Phe 
115 120 125 

Ser Ala Leu Ser Phe Leu Lys Ser Pro Ala Ser Thr Val Thr Asn Gly 
130 135 140 



Leu Gly Ala lie Asn Val Lys Gly Asn Leu Ser Leu Leu Asp Asn Asp 
145 150 155 160 
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Lys Val Leu lie Gin Asp Asn Phe Ser Thr Gly Asp Gly Gly Ala lie 
165 170 175 

Asn Cys Ala Gly Ser Leu Lys lie Ala Asn Asn Lys Ser Leu Ser Phe 
180 185 190 

lie Gly Asn Ser Ser Ser Thr Arg Gly Gly Ala lie His Thr Lys Asn 
195 200 205 

Leu Thr Leu Ser Ser Gly Gly Glu Thr Leu Phe Gin Gly Asn Thr Ala 
210 215 220 

Pro Thr Ala Ala Gly Lys Gly Gly Ala lie Ala lie Ala Asp Ser Gly 
225 230 235 240 

Thr Leu Ser lie Ser Gly Asp Ser Gly Asp lie lie Phe Glu Gly Asn 
245 250 255 

Thr lie Gly Ala Thr Gly Thr Val Ser His Ser Ala lie Asp Leu Gly 
260 265 270 

Thr Ser Ala Lys lie Thr Ala Leu Arg Ala Ala Gin Gly His Thr lie 
275 280 285 

Tyr Phe Tyr Asp Pro lie Thr Val Thr Gly Ser Thr Ser Val Ala Asp 
290 295 300 

Ala Leu Asn lie Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr 
305 310 315 320 

Gly Thr lie Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys 
325 330 335 

Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys 
340 345 350 

Asn Gly Thr Val Val Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly 
355 360 365 

Phe Ser Gin Asp Ala Asn Ser Lys Leu lie Met Asp Leu Gly Thr Ser 
370 375 380 

Leu Val Ala Asn Thr Glu Ser lie Glu Leu Thr Asn Leu Glu lie Asn 
385 390 395 400 

lie Asp Ser Leu Arg Asn Gly Lys Lys lie Lys Leu Ser Ala Ala Thr 
405 410 415 

Ala Gin Lys Asp lie Arg lie Asp Arg Pro Val Val Leu Ala lie Ser 
420 425 430 

Asp Glu Ser Phe Tyr Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr 
435 440 445 



Asp Gly lie Leu Glu Leu Asp Ala Gly Lys Asp lie Val lie Ser Ala 
450 455 460 
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Asp Ser Arg Ser lie Asn Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly 
465 470 475 480 

Lys Trp Thr lie Asn Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser 
485 490 495 

Trp Ala Lys Gin Ser Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu 
500 505 510 

Val Pro Asn Leu Leu Trp Gly Ser Phe lie Asp Val Arg Pro Phe Gin 
515 520 525 

Asn Phe lie Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe 
530 535 540 

Trp Val Ala Gly lie Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn 
545 550 555 560 

Gin Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser 
565 570 575 

Thr Arg Met Pro Gly Gly Asp .Thr Leu Ser Leu Gly Phe Ala Gin Leu 
580 585 590 

Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr 
595 600 605 

Tyr Ala Gly Ser Leu Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val 
610 615 620 

Val Ser lie Leu Leu Gly Glu Gly Gly Leu Arg Glu lie Leu Leu Pro 
625 630 635 640 

Tyr Val Ser Lys Thr Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr 
645 650 655 

Gly His Thr Asp His Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro 
660 665 670 

Pro Thr Leu Ser Thr Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala 
675 680 685 

Gly Glu Leu Gly Thr Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly 
690 695 700 

Phe Phe Arg Glu Tyr Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ser 
705 710 715 720 

Arg Gin Asp Ser Phe Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser 
725 730 735 

Asp Ser His Leu Tyr Asn Leu Ala lie Pro Leu Gly lie Lys Leu Glu 
740 745 750 



Lys Arg Phe Ala Glu Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro 
755 760 765 
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Asp Val Cys Arg Ser Asn Pro Lys Cys Thr Thr Thr Leu Leu Ser Asn 
770 775 780 

Gin Gly Ser Trp Lys Thr Lys Gly Ser Asn Leu Ala Arg Gin Ala Gly 



Phe Gly Asn Phe Gly Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn 
820 825 830 

Val Asp Ala Gly Ser Lys lie Lys Phe 
835 840 

(2) .INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGAAGTCTT CTTTCCCCAA GTTTGTATTT TCTACATTTG CTATTTTCCC TTTGTCTATG 60 

ATTGCTACCG AGACAGTTTT GGATTCAAGT GCGAGTTTCG ATGGGAATAA AAATGGTAAT 12 0 

TTTTCAGTTC GTGAGAGTCA GGAAGATGCT GGAACTACCT ACCTATTTAA GGGAAATGTC 180 

ACTCTAGAAA ATATTCCTGG AACAGGCACA GCAATCACAA 7\AAGCTGTTT TAACAACACT 240 

AAGGGCGATT TGACTTTCAC AGGTAACGGG AACTCTCTAT TGTTCCAAAC GGTGGATGCA 3 00 

GGGACTGTAG CAGGGGCTGC TGTTAACAGC AGCGTGGTAG ATAAATCTAC CACGTTTATA 3 60 

GGGTTTTCTT CGCTATCTTT TATTGCGTCT CCTGGAAGTT CGATAACTAC CGGCAAAGGA 42 0 

GCCGTTAGCT GCTCTACGGG TAGCTTGAAG TTTGACAAAA ATGTCAGTTT GCTCTTCAGC 4 80 

AAAAACTTTT CAACGGATAA TGGCGGTGCT ATCACCGCAA AAACTCTTTC ATTAACAGGG 54 0 

ACTACAATGT CAGCTCTGTT TTCTGAAAAT ACCTCCTCAA AGAAAGGCGG AGCCATTCAG 600 

ACTTCCGATG CCCTTACCAT TACTGGAAAC CAAGGGGAAG TCTCTTTTTC TGACAATACT 660 

TCTTCGGATT CTGGAGCTGC AATTTTTACA GAAGCCTCGG TGACTATTTC TAATAATGCT 720 

AAAGTTTCCT TTATTGACAA TAAGGTCACA GGAGCGAGCT CCTCAACAAC GGGGGATATG 780 

TCAGGAGGTG CTATCTGTGC TTATAAAACT AGTACAGATA CTAAGGTCAC CCTCACTGGA 84 0 

AATCAGATGT TACTCTTCAG CAACAATACA TCGACAACAG CGGGAGGAGC TATCTATGTG 900 

AAAAAGCTCG AACTGGCTTC CGGAGGACTT ACCCTATTCA GTAGAAATAG TGTCAATGGA 960 

GGTACAGCTC CTAAAGGTGG AGCCATAGCT ATCGAAGATA GTGGGGAATT GAGTTTATCC 102 0 

GCCGATAGTG GTGACATTGT CTTTTTAGGG AATACAGTCA CTTCTACTAC TCCTGGGACG 108 0 

AATAGAAGTA GTATCGACTT AGGAACGAGT GCAAAGATGA CAGCTTTGCG TTCTGCTGCT 114 0 

GGTAGAGCCA TCTACTTCTA TGATCCCATA ACTACAGGAT CTTCCACAAC AGTTACAGAT 12 00 

GTCTTAAAAG TTAATGAGAC TCCGGCAGAT TCTGCACTAC AATATACAGG GAACATCATC 12 60 

TTCACAGGAG AAAAGTTATC AGAGACAGAG GCCGCAGATT CTAAAAATCT TACTTCGAAG 1320 

CTACTACAGC CTGTAACTCT TTCAGGAGGT ACTCTATCTT TAAAACATGG AGTGACTCTG 1380 

CAGACTCAGG CATTCACTCA ACAGGCAGAT TCTCGTCTCG AAATGGACGT AGGAACTACT 14 4 0 

CTAGAACCTG CTGATACTAG CACCATAAAC AATTTGGTCA TTAACATCAG TTCTATAGAC 1500 

GGTGCAAAGA AGGCAAAAAT AGAAACCAAA GCTACGTCAA AAPATCTGAC TTTATCTGGA 15 60 

ACCATCACTT TATTGGACCC GACGGGCACG TTTTATGAAA ATCATAGTTT AAGAAATCCT 1620 

CAGTCCTACG ACATCTTAGA GCTCAAAGCT TCTGGAACTG TAACAAGCAC CGCAGTGACT 1680 

CCAGATCCTA TAATGGGTGA GAAATTCCAT TACGGCTATC AGGGAACTTG GGGCCCAATT 17 4 0 

GTTTGGGGGA CAGGGGCTTC TACGACTGCA ACCTTCAACT GGACTAAAAC TGGCTATATT 18 00 

CCTAATCCCG AGCGTATCGG CTCTTTAGTC CCTAATAGCT TATGGAATGC ATTTATAGAT 18 60 
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ATTAGCTCTC TCCATTATCT TATGGAGACT GCAAACGAAG GGTTGCAGGG AGACCGTGCT 192 0 

TTTTGGTGTG CTGGATTATC TAACTTCTTC CATAAGGATA GTACAAAAAC ACGACGCGGG 19 80 

TTTCGCCATT TGAGTGGCGG TTATGTCATA GGAGGAAACC TACATACTTG TTCAGATAAG 204 0 

ATTCTTAGTG CTGCATTTTG TCAGCTCTTT GGAAGAGATA GAGACTACTT TGTAGCTAAG 2100 

AATCAAGGTA CAGTCTACGG AGGAACTCTC TATTACCAGC ACAACGAAAC CTATATCTCT 2160 

CTTCCTTGCA AACTACGGCC TTGTTCGTTG TCTTATGTTC CTACAGAGAT TCCTGTTCTC 222 0 

TTTTCAGGAA ACCTTAGCTA CACCCATACG GATAACGATC TGAAAACCAA GTATACAACA 22 80 

TATCCTACTG TTAAAGGAAG CTGGGGGAAT GATAGTTTCG CTTTAGAATT CGGTGGAAGA 234 0 

GCTCCGATTT GCTTAGATGA AAGTGCTCTA TTTGAGCAGT ACATGCCCTT CATGAAATTG 24 00 

CAGTTTGTCT ATGCACATCA GGAAGGTTTT AAAGAACAGG GAACAGAAGC TCGTGAATTT 24 60 

GGAAGTAGCC GTCTTGTGAA TCTTGCCTTA CCTATCGGGA TCCGATTTGA TAAGGAATCA 252 0 

GACTGCCAAG ATGCAACGTA CAATCTAACT CTTGGTTATA CTGTGGATCT TGTTCGTAGT 2 58 0 

AACCCCGACT GTACGACAAC ACTGCGAATT AGCGGTGATT CTTGGAAAAC CTTCGGTACG 2 64 0 

AATTTGGCAA GACAAGCTTT AGTCCTTCGT GCAGGGAACC ATTTTTGCTT TAACTCAAAT 2 700 

TTTGAAGCCT TTAGCCAATT TTCTTTTGAA TTGCGTGGGT CATCTCGCAA TTACAATGTA 2 7 60 

GACTTAGGAG CAA/^ATACCA ATTCTAA 27 8 7 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys Ser Ser Phe Pro Lys Phe Val Phe Ser Thr Phe Ala lie Phe 
15 10 15 

Pro Leu Ser Met lie Ala Thr Glu Thr Val Leu Asp Ser Ser Ala Ser 
20 25 30 

Phe Asp Gly Asn Lys Asn Gly Asn Phe Ser Val Arg Glu Ser Gin Glu 
35 40 45 

Asp Ala Gly Thr Thr Tyr Leu Phe Lys Gly Asn Val Thr Leu Glu Asn 
50 55 60 

lie Pro Gly Thr Gly Thr Ala lie Thr Lys Ser Cys Phe Asn Asn Thr 
65 70 75 80 

Lys Gly Asp Leu Thr Phe Thr Gly Asn Gly Asn Ser Leu Leu Phe Gin 
85 90 95 

Thr Val Asp Ala Gly Thr Val Ala Gly Ala Ala Val Asn Ser Ser Val 
100 105 110 

Val Asp Lys Ser Thr Thr Phe lie Gly Phe Ser Ser Leu Ser Phe lie 
115 120 125 

Ala Ser Pro Gly Ser Ser lie Thr Thr Gly Lys Gly Ala Val Ser Cys 
130 135 140 

Ser Thr Gly Ser Leu Lys Phe Asp Lys Asn Val Ser Leu Leu Phe Ser 
145 150 155 160 
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Lys Asn Phe Ser Thr Asp Asn Gly Gly Ala lie Thr Ala Lys Thr Leu 
165 170 175 

Ser Leu Thr Gly Thr Thr Met Ser Ala Leu Phe Ser Glu Asn Thr Ser 
180 185 190 

Ser Lys Lys Gly Gly Ala lie Gin Thr Ser Asp Ala Leu Thr lie Thr 
195 200 205 

Gly Asn Gin Gly Glu Val Ser Phe Ser Asp Asn Thr Ser Ser Asp Ser 
210 215 220 

Gly Ala Ala lie Phe Thr Glu Ala Ser Val Thr lie Ser Asn Asn Ala 
225 230 235 240 

Lys Val Ser Phe lie Asp Asn Lys Val Thr Gly Ala Ser Ser Ser Thr 
245 250 255 

Thr Gly Asp Met Ser Gly Gly Ala lie Cys Ala Tyr Lys Thr Ser Thr 
260 265 270 

Asp Thr Lys Val Thr Leu Thr Gly Asn Gin Met Leu Leu Phe Ser Asn 
275 280 285 

Asn Thr Ser Thr Thr Ala Gly Gly Ala lie Tyr Val Lys Lys Leu Glu 
290 295 300 

Leu Ala Ser Gly Gly Leu Thr Leu Phe Ser Arg Asn Ser Val Asn Gly 
305 310 315 320 

Gly Thr Ala Pro Lys Gly Gly Ala lie Ala lie Glu Asp Ser Gly Glu 
325 330 335 

Leu Ser Leu Ser Ala Asp Ser Gly Asp lie Val Phe Leu Gly Asn Thr 
340 345 350 

Val Thr Ser Thr Thr Pro Gly Thr Asn Arg Ser Ser lie Asp Leu Gly 
355 360 . 365 

Thr Ser Ala Lys Met Thr Ala Leu Arg Ser Ala Ala Gly Arg Ala lie 
370 375 380 

Tyr Phe Tyr Asp Pro lie Thr Thr Gly Ser Ser Thr Thr Val Thr Asp 
385 390 395 400 

Val Leu Lys Val Asn Glu Thr Pro Ala Asp Ser Ala Leu Gin Tyr Thr 
405 410 415 

Gly Asn lie lie Phe Thr Gly Glu Lys Leu Ser Glu Thr Glu Ala Ala 
420 425 430 

Asp Ser Lys Ash Leu Thr Ser Lys Leu Leu Gin Pro Val Thr Leu Ser 
435 440 445 

Gly Gly Thr Leu Ser Leu Lys His Gly Val Thr Leu Gin Thr Gin Ala 
450 455 460 
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Phe Thr Gin Gin Ala Asp Ser Arg Leu Glu Met Asp Val Gly Thr Thr 
465 470 475 480 

Leu Glu Pro Ala Asp Thr Ser Thr lie Asn Asn Leu Val lie Asn lie 
485 . 490 495 

Ser Ser lie Asp Gly Ala Lys Lys Ala Lys lie Glu Thr Lys Ala Thr 
500 505 510 

Ser Lys Asn Leu Thr Leu Ser Gly Thr lie Thr Leu Leu Asp Pro Thr 
515 520 525 

Gly Thr Phe Tyr Glu Asn His Ser Leu Arg Asn Pro Gin Ser Tyr Asp 
530 535 540 

lie Leu Glu Leu Lys Ala Ser Gly Thr Val Thr Ser Thr Ala Val Thr 
545 550 555 560 

Pro Asp Pro lie Met Gly Glu Lys Phe His Tyr Gly Tyr Gin Gly Thr 
565 570 575 

Trp Gly Pro lie Val Trp Gly Thr Gly Ala Ser Thr Thr Ala Thr Phe 
580 585 590 

Asn Trp Thr Lys Thr Gly Tyr lie Pro Asn Pro Glu Arg lie Gly Ser 
595 600 605 

Leu Val Pro Asn Ser Leu Trp Asn Ala Phe* lie Asp lie Ser Ser Leu 
610 615 620 

His Tyr Leu Met Glu Thr Ala Asn Glu Gly Leu Gin Gly Asp Arg Ala 
625 630 635 640 

Phe Trp Cys Ala Gly Leu Ser Asn Phe Phe His Lys Asp Ser Thr Lys 
645 650 655 

Thr Arg Arg Gly Phe Arg His Leu Ser Gly Gly Tyr Val lie Gly Gly 
660 665 670 

Asn Leu His Thr Cys Ser Asp Lys lie Leu Ser Ala Ala Phe Cys Gin 
675 680 685 

Leu Phe Gly Arg Asp Arg Asp Tyr Phe Val Ala Lys Asn Gin Gly Thr 
690 695 700 

Val Tyr Gly Gly Thr Leu Tyr Tyr Gin His Asn Glu Thr Tyr lie Ser 
705 710 715 720 

Leu Pro Cys Lys Leu Arg Pro Cys Ser Leu Ser Tyr Val Pro Thr Glu 
725 730 735 

lie Pro Val Leu Phe Ser Gly Asn Leu Ser Tyr Thr His Thr Asp Asn 
740 745 750 



Asp Leu Lys Thr Lys Tyr Thr Thr Tyr Pro Thr Val Lys Gly Ser Trp 
755 760 765 
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Gly Asn Asp Ser Phe Ala Leu Glu Phe Gly Gly Arg Ala Pro lie Cys 
770 775 780 

Leu Asp Glu Ser Ala Leu Phe Glu Gin Tyr Met Pro Phe Met Lys Leu 
785 790 795 800 

Gin Phe Val Tyr Ala His Gin Glu Gly Phe Lys Glu Gin Gly Thr Glu 
805 810 815 

Ala Arg Glu Phe Gly Ser Ser Arg Leu Val Asn Leu Ala Leu Pro lie 
820 825 830 

Gly lie Arg Phe Asp Lys Glu Ser Asp Cys Gin Asp Ala Thr Tyr Asn 
835 840 845 

Leu Thr Leu Gly Tyr Thr Val Asp Leu Val Arg Ser Asn Pro Asp Cys 
850 855 860 

Thr Thr Thr Leu Arg lie Ser Gly Asp Ser Trp Lys Thr Phe Gly Thr 
865 870 875 880 

Asn Leu Ala Arg Gin Ala Leu Val Leu Arg Ala Gly Asn His Phe Cys 
885 890 895 

Phe Asn Ser Asn Phe Glu Ala Phe Ser Gin Phe Ser Phe Glu Leu Arg 
900 905 910 

Gly Ser Ser Arg Asn Tyr Asn Val Asp Leu Gly Ala Lys Tyr Gin Phe 
915 920 925 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGAGATCGT CTTTTTCCTT GTTATTAATA TCTTCATCTC TAGCCTTTCC TCTCTTAATG 60 

AGTGTTTCTG CAGATGCTGC CGATCTCACA TTAGGGAGTC GTGACAGTTA TAATGGTGAT 120 

ACAAGCACCA CAGAATTTAC TCCTAAAGCG GCAACTTCTG ATGCTAGTGG CACGACCTAT 180 

ATTCTCGATG GGGATGTCTC GATAAGCCAA GCAGGGAAAC AAACGAGCTT AACCACAAGT 24 0 

TGTTTTTCTA ACACTGCAGG AAATCTTACC TTCTTAGGGA ACGGATTTTC TCTTCATTTT 300 

GACAATATTA TTTCGTCTAC TGTTGCAGGT GTTGTTGTTA GCAATACAGC AGCTTCTGGG 360 

ATTACGAAAT TCTCAGGATT TTCAACTCTT CGGATGCTTG CAGCTCCTAG GACCACAGGT 42 0 

AAAGGAGCCA TTAAAATTAC CGATGGTCTG GTGTTTGAGA GTATAGGGAA TCTTGACCAA 480 

AATGTW^TG CCTCTAGTGA AAATGGGGGA GCCATCAATA CGAAGACTTT GTCTTTGACT 54 0 

GGGAGTACGC GGTTTGTAGC GTTCCTTGGC AATAGCTCGT CGCAACAAGG GGGAGCGATC 600 

TATGCTTCTG GTGACTCTGT GATTTCTGAG AATGCAGGAA TCTTGAGCTT CGGAAACAAC 660 

AGTGCGACAA CATCAGGAGG CGCGATCTCT GCTGAAGGGA ACCTTGTGAT CTCCAATAAC 72 0 

CAAAATATCT TTTTCGATGG CTGCAAAGCA ACTACAAATG GCGGAGCTAT TGATTGTAAC 780 

AAAGCAGGGG CGAACCCAGA CCCTATCTTG ACTCTTTCAG GAAATGAGAG CCTGCATTTT 84 0 

CTGAATAACA CAGCAGGAAA TAGTGGAGGT GCGATTTATA CCAAAAAATT GGTGTTATCC 900 

TCAGGACGAG GAGGAGTGTT ATTTTCTAAC AACAAAGCTG CGAATGCTAC TCCTAAAGGA 960 
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GGGGCAATTG CGATTCTAGA TTCTGGAGAG ATTAGCATTT CTGCAGATCT CGGCAATATC 102 0 

ATTTTCGAGG GCAATACTAC GAGCACTACA GGAAGTCCTG CGAGTGTGAC CAGAAATGCT 108 0 

ATAGATCTTG CATCGAATGC AAAATTTTTA T^TCTCCGAG CGACTCGGGG AAATAAAGTT 114 0 

ATTTTCTATG ATCCTATCAC GAGCTCAGGA GCTACTGATA AGCTCTCTTT GAATAAAGCT 1200 

GACGCAGGAT CTGGAAATAC CTATGAAGGC TACATCGTTT TCTCTGGAGA GAAACTCTCA 12 60 

GAAGAGGAAC TTAAGAAACC TGACAATCTG AAGTCTACAT TTACACAGGC TGTAGAGCTT 132 0 

GCTGCAGGTG CCTTAGTATT GAAAGATGGA GTGACTGTAG TTGCAAATAC TATJ^ACGCAG 138 0 

GTCGAGGGAT CGAAAGTCGT TATGGATGGA GGGACTACTT TTGAGGCAAG CGCTGAGGGG 14 4 0 

GTCACTCTCA ATGGCCTAGC CATTAATATA GATTCCTTAG ATGGGACAAA TAAAGCTATC 1500 

ATTAAGGCGA CGGCAGCAAG TAAGGATGTT GCCTTATCAG GGCCTATCAT GCTTGTAGAT 1560 

GCTCAGGGGA ACTATTATGA GCATCATAAT CTCAGTCAAC AGCAGGTCTT TCCTTTAATA 1620 

GAGCTTTCTG CACAAGGAAC GATGACTACT ACAGATATCC CCGATACCCC AATTCTAAAT 168 0 

ACTACGAATC ACTATGGGTA TCAAGGAACT GGAATAATTG TTTGGGTCGA CGATGCAACT 17 4 0 

GCAAAAACAA AAAATGCTAC CTTAACTTGG ACTAAAACAG GATACAAGCC GAATCCAGAA 18 00 

CGTCAGGGAC' CTTTGGTTCC TAATAGCCTG TGGGGTTCTT TTGTCGATGT CCGCTCCATT 18 60 

CAGAGCCTCA TGGACCGGAG CACAAGTTCG TTATCTTCGT CAACAAATTT GTGGGTATCA 192 0 

GGAATCGCGG ACTTTTTGCA TGAAGATCAG AAAGGAAACC AACGTAGTTA TCGTCATTCT 1980 

AGCGCGGGTT ATGCATTAGG AGGAGGATTC TTCACGGCTT CTGAAAATTT CTTTAATTTT 204 0 

GCTTTTTGTC AGCTTTTTGG CTACGACAAG GACCATCTTG TGGCTAAGAA CCATACCCAT 2100 

GTATATGCAG GGGCAATGAG TTACCGACAC CTCGGAGAGT CTAAGACCCT CGCTAAGATT 2160 

TTGTCAGGAA ATTCTGACTC CCTACCTTTT GTCTTCAATG CTCGGTTTGC TTATGGCCAT 2220 

ACCGACAATA ACATGACCAC AAAGTACACT GGCTATTCTC CTGTTAAGGG AAGCTGGGGA 22 8 0 

AATGATGCCT TCGGTATAGA ATGTGGAGGA GCTATCCCGG TAGTTGCTTC AGGACGTCGG 234 0 

TCTTGGGTGG ATACCCACAC GCCATTTCTA AACCTAGAGA TGATCTATGC ACATCAGAAT 2 4 00 

GACTTTAAGG AAAACGGCAC AGAAGGCCGT TCTTTCCAAA GTGAAGACCT CTTCAATCTA 24 60 

GCGGTTCCTG TAGGGATAAA ATTTGAGAAA TTCTCCGATA AGTCTACGTA TGATCTCTCC 2 52 0 

ATAGCTTACG TTCCCGATGT GATTCGTAAT GATCCAGGCT GCACGACAAC TCTTATGGTT 258 0 

TCTGGGGATT CTTGGTCGAC ATGTGGTACA AGCTTGTCTA GACAAGCTCT TCTTGTACGT 2 64 0 

GCTGGAAATC ATCATGCCTT TGCTTCAAAC TTTGAAGTTT TCAGTCAGTT TGAAGTCGAG 27 0 0 

TTGCGAGGTT CTTCTCGTAG CTATGCTATC GATCTTGGAG GAAGATTCGG ATTTTAA 27 5 7 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Arg Ser Ser Phe Ser Leu Leu Leu lie Ser Ser Ser Leu Ala Phe 
15 10 15 

Pro Leu Leu Met Ser Val Ser Ala Asp Ala Ala Asp Leu Thr Leu Gly 
20 25 30 

Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr Thr Glu Phe Thr Pro 
35 40 45 

Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr Tyr lie Leu Asp Gly 
50 55 60 



Asp Val Ser lie Ser Gin Ala Gly Lys Gin Thr Ser Leu Thr Thr Ser 
65 70 75 80 
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Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe Leu Gly Asn Gly Phe 
85 90 95 

Ser Leu His Phe Asp Asn lie lie Ser Ser Thr Val Ala Gly Val Val 
100 105 110 

Val Ser Asn Thr Ala Ala Ser Gly lie Thr Lys Phe Ser Gly Phe Ser 
115 120 125 

Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr Gly Lys Gly Ala lie 
130 135 140 

Lys lie Thr Asp Gly Leu Val Phe Glu Ser lie Gly Asn Leu Asp Gin 
145 150 155 160 

Asn Glu Asn Ala Ser Ser Glu Asn Gly Gly Ala lie Asn Thr Lys Thr 
165 170 175 

Leu Ser Leu Thr Gly Ser Thr Arg Phe Val Ala Phe Leu Gly Asn Ser 
180 185 190 

Ser Ser Gin Gin Gly Gly Ala lie Tyr Ala Ser . Gly Asp Ser Val lie 
195 200 "•' • 205 

Ser Glu Asn Ala Gly lie Leu Ser Phe Gly Asn Asn Ser Ala Thr Thr 
210 215 220 

Ser Gly Gly Ala lie Ser Ala Glu Gly Asn Leu Val lie Ser Asn Asn 
225 230 235 240 

Gin Asn lie Phe Phe Asp Gly Cys Lys Ala Thr Thr Asn Gly Gly Ala 
245 250 255 

lie Asp Cys Asn Lys Ala Gly Ala Asn Pro Asp Pro lie Leu Thr Leu 
260 265 270 

Ser Gly Asn Glu Ser Leu His Phe Leu Asn Asn Thr Ala Gly Asn Ser 
275 280 285 

Gly Gly Ala lie Tyr Thr Lys Lys Leu Val Leu Ser Ser Gly Arg Gly 
290 295 300 

Gly Val Leu Phe Ser Asn Asn Lys Ala Ala Asn Ala Thr Pro Lys Gly 
305 310 315 320 

Gly Ala lie Ala lie Leu Asp Ser Gly Glu lie Ser lie Ser Ala Asp 
325 330 335 

Leu Gly Asn lie lie Phe Glu Gly Asn Thr Thr Ser Thr Thr Gly Ser 
340 345 350 

Pro Ala Ser Val Thr Arg Asn Ala lie Asp Leu Ala Ser Asn Ala Lys 
355 360 365 

Phe Leu Asn Leu Arg Ala Thr Arg Gly Asn Lys Val lie Phe Tyr Asp 
370 375 380 
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Pro lie Thr Ser Ser Gly Ala Thr Asp Lys Leu Ser Leu Asn Lys Ala 
385 390 395 400 

Asp Ala Gly Ser Gly Asn Thr Tyr Glu Gly Tyr lie Val Phe Ser Gly 
405 410 415 

Glu Lys Leu Ser Glu Glu Glu Leu Lys Lys Pro Asp Asn Leu Lys Ser 
420 425 430 

Thr Phe Thr Gin Ala Val Glu Leu Ala Ala Gly Ala Leu Val Leu Lys 
435 440 445 

Asp Gly Val Thr Val Val Ala Asn Thr lie Thr Gin Val Glu Gly Ser 
450 455 460 

Lys Val Val Met Asp Gly Gly Thr Thr Phe Glu Ala Ser Ala Glu Gly 
465 470 475 480 

Val Thr Leu Asn Gly Leu Ala lie Asn lie Asp Ser Leu Asp Gly Thr 
485 490 495 

Asn Lys Ala lie lie Lys Ala Thr Ala Ala Ser Lys Asp Val Ala Leu 
500 505 ' 510 

Ser Gly Pro lie Met Leu Val Asp Ala Gin Gly Asn Tyr Tyr Glu His 
515 520 525 

His Asn Leu Ser Gin Gin Gin Val Phe Pro Leu lie Glu Leu Ser Ala 
530 535 540 

Gin Gly Thr Met Thr Thr Thr Asp lie Pro Asp Thr Pro lie Leu Asn 
545 550 555 560 

Thr Thr Asn His Tyr Gly Tyr Gin Gly Thr Gly lie lie Val Trp Val 
565 570 575 

Asp Asp Ala Thr Ala Lys Thr Lys Asn Ala Thr Leu Thr Trp Thr Lys 
580 585 590 

Thr Gly Tyr Lys Pro Asn Pro Glu Arg Gin Gly Pro Leu Val Pro Asn 
595 600 605 

Ser Leu Trp Gly Ser Phe Val Asp Val Arg Ser lie Gin Ser Leu Met 
610 615 620 

Asp Arg Ser Thr Ser Ser Leu Ser Ser Ser Thr Asn Leu Trp Val Ser 
625 630 635 640 

Gly lie Ala Asp Phe Leu His Glu Asp Gin Lys Gly Asn Gin Arg Ser 
645 650 655 

Tyr Arg His Ser Ser Ala Gly Tyr Ala Leu Gly Gly Gly Phe Phe Thr 
660 665 670 

Ala Ser Glu Asn Phe Phe Asn Phe Ala Phe Cys Gin Leu Phe Gly Tyr 
675 680 685 



Asp Lys Asp His Leu Val Ala Lys Asn His Thr His Val Tyr Ala Gly 
690 695 700 



Ala Met Ser Tyr Arg His Leu Gly Glu Ser Lys Thr Leu Ala Lys lie 
705 710 715 720 

Leu Ser Gly Asn Ser Asp Ser Leu Pro Phe Val Phe Asn Ala Arg Phe 
725 730 735 

Ala Tyr Gly His Thr Asp Asn Asn Met Thr Thr Lys Tyr Thr Gly Tyr 
740 745 750 

Ser Pro Val Lys Gly Ser Trp Gly Asn Asp Ala Phe Gly lie Glu Cys 
755 760 765 

Gly Gly Ala lie Pro Val Val Ala Ser Gly Arg Arg Ser Trp Val Asp 
770 775 780 

Thr His Thr Pro Phe Leu Asn Leu Glu Met lie Tyr Ala His Gin Asn 
785 790 795 800 

Asp Phe Lys Glu Asn Gly Thr Glu Gly Arg Ser Phe Gin Ser Glu Asp 
805 810 815 

Leu Phe Asn Leu Ala Val Pro Val Gly lie Lys Phe Glu Lys Phe Ser 
820 825 830 

Asp Lys Ser Thr Tyr Asp Leu Ser lie Ala Tyr Val Pro Asp Val lie 
835 840 845 

Arg Asn Asp Pro Gly Cys Thr Thr Thr Leu Met Val Ser Gly Asp Ser 
850 ■ 855 860 

Trp Ser Thr Cys Gly Thr Ser Leu Ser Arg Gin Ala Leu Leu Val Arg 
865 870 875 880 

Ala Gly Asn His His Ala Phe Ala Ser Asn Phe Glu Val Phe Ser Gin 
885 890 895 

Phe Glu Val Glu Leu Arg Gly Ser Ser Arg Ser Tyr Ala lie Asp Leu 
900 905 910 

Gly Gly Arg Phe Gly Phe 
915 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



ATGAAATCCT CTCTTCATTG GTTTGTAATC TCGTCATCTT TAGCACTTCC CTTGTCACTA 
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AATTTCTCTG CGTTTGCTGC TGTTGTTGAA ATCAATCTAG GACCTACCAA TAGCTTCTCT 12 0 

GGACCAGGAA CCTACACTCC TCCAGCCCAA ACAACAAATG CAGATGGAAC TATCTATAAT 18 0 

CTAACAGGGG ATGTCTCAAT CACCAATGCA GGATCTCCGA CAGCTCTAAC CGCTTCCTGC 24 0 

TTTAAAGAAA CTACTGGGAA TCTTTCTTTC CAAGGCCACG GCTACCT^ATT TCTCCTACAA 30 0 
AATATCGATG CGGGAGCGAA CTGTACCTTT ACCAATACAG CTGCAAATAA GCTTCTCTCC . 3 60 

TTTTCAGGAT TCTCCTATTT GTCACTAATA CAAACCACGA ATGCTACCAC AGGAACAGGA 42 0 

GCCATCAAGT CCACAGGAGC TTGTTCTATT CAGTCGAACT ATAGTTGCTA CTTTGGCCAA 4 80 

AACTTTTCTA ATGACAATGG AGGCGCCCTC CAAGGCAGCT CTATCAGTCT ATCGCTAAAC 54 0 

CCCAACCTAA CGTTTGCCAA AAACAAAGCA ACGCAAAAAG GGGGTGCCCT CTATTCCACG 600 

GGAGGGATTA CAATTAACAA TACGTTAAAC TCAGCATCAT TTTCTGAAAA TACCGCGGCG 660 

AACAATGGCG GAGCCATTTA CACGGAAGCT AGCAGTTTTA TTAGCAGCAA CAAAGCAATT 72 0 

AGCTTTATAA ACAATAGTGT GACCGCAACC TCAGCTACAG GGGGAGCCAT TTACTGTAGT 7 80 

AGTACATCAG CCCCCAAACC AGTCTTAACT CTATCAGACA ACGGGGAACT GAACTTTATA 84 0 

GGAAATACAG CAATTACTAG TGGTGGGGCG ATTTATACTG ACAATCTAGT TCTTTCTTCT 900 

GGAGGACCTA CGCTTTTTAA AAACAACTCT GCTATAGATA CTGCAGCTCC CTTAGGAGGA 960 

GCAATTGCGA TTGCTGACTC TGGATCTTTG AGTCTTTCGG CTCTTGGTGG AGACATCACT 102 0 

TTTGAAGGAA ACACAGTAGT CAAAGGAGCT TCTTCGAGTC AGACCACTAC CAGAAATTCT 10 80 

ATTAACATCG GAAACACCAA TGCTAAGATT GTACAGCTGC GAGCCTCTCA AGGCAATACT 114 0 

ATCTACTTCT ATGATCCTAT AACAACTAAC CATACTGCAG CTCTCTCAGA TGCTCTAAAC 12 00 

TTAAATGGTC CTGACCTTGC AGGGAATCCT GCATATCAAG GAACCATCGT ATTTTCTGGA 12 60 

GAGAAGCTCT CGGAAGCAGA AGCTGCAGAA GCTGATAATC TCAAATCTAC AATTCAGCAA 132 0 

CCTCTAACTC TTGCGGGAGG GCAACTCTCT CTTAAATCAG GAGTCACTCT AGTTGCTAAG 13 8 0 

TCCTTTTCGC AATCTCCGGG CTCTACCCTC CTCATGGATG CAGGGACCAC ATTAGAAACC 14 4 0 

GCTGATGGGA TCACTATCAA TAATCTTGTT CTCAATGTAG ATTCCTTAAA AGAGACCAAG 15 00 

AAGGCTACGC TAAAAGCAAC ACAAGCAAGT CAGACAGTCA CTTTATCTGG ATCGCTCTCT 15 60 

CTTGTAGATC CTTCTGGAAA TGTCTACGAA GATGTCTCTT GGAATAACCC TCAAGTCTTT 162 0 

TCTTGTCTCA CTCTTACTGC TGACGACCCC GCGAATATTC ACATCACAGA CTTAGCTGCT 168 0 

GATCCCCTAG AAAAAAATCC TATCCATTGG GGATACCAAG GGAATTGGGC ATTATCTTGG 17 4 0 

CAAGAGGATA CTGCGACTAA ATCCAAAGCA GCGACTCTTA CCTGGACAAA AACAGGATAC 18 00 

AATCCGAATC CTGAGCGTCG TGGAACCTTA GTTGCTAACA CGCTATGGGG ATCCTTTGTT 18 60 

GATGTGCGCT CCATACAACA GCTTGTAGCC ACTAAAGTAC GCCAATCTCA AGAT^CTCGC 192 0 

GGCATCTGGT GTGAAGGGAT CTCGAACTTC TTCCATAAAG ATAGCACGAA GATAAATAi^A 19 8 0 

GGTTTTCGCC ACATAAGTGC AGGTTATGTT GTAGGAGCGA CTACAACATT AGCTTCTGAT 2 04 0 

AATCTTATCA CTGCAGCCTT CTGCCAATTA TTCGGGAAAG ATAGAGATCA CTTTATAAAT 2100 

AAAAATAGAG CTTCTGCCTA TGCAGCTTCT CTCCATCTCC AGCATCTAGC GACCTTGTCT 2160 

TCTCCAAGCT TGTTACGCTA CCTTCCTGGA TCTGAAAGTG AGCAGCCTGT CCTCTTTGAT 222 0 

GCTCAGATCA GCTATATCTA TAGTAAAAAT ACTATGAAAA CCTATTACAC CCAAGCACCA 22 8 0 

AAGGGAGAGA GCTCGTGGTA TAATGACGGT TGCGCTCTGG AACTTGCGAG CTCCCTACCA 2 34 0 

CACACTGCTT TAAGCCATGA GGGTCTCTTC CACGCGTATT TTCCTTTCAT CAAAGTAGAA 2 4 00 

GCTTCGTACA TACACCAAGA TAGCTTCAAA GAACGTAATA CTACCTTGGT ACGATCTTTC 2 4 60 

GATAGCGGTG ATTTAATTAA CGTCTCTGTG CCTATTGGAA TTACCTTCGA GAGATTCTCG 252 0 

AGAAACGAGC GTGCGTCTTA CGAAGCTACT GTCATCTACG TTGCCGATGT CTATCGTAAG 25 8 0 

AATCCTGACT GCACGACAGC TCTCCTAATC AACAATACCT CGTGGAAAAC TACAGGAACG 2 64 0 

AATCTCTCAA GACAAGCTGG TATCGGAAGA GCAGGGATCT TTTATGCCTT CTCTCCAAAT 27 00 

CTTGAGGTCA CAAGTAACCT ATCTATGGAA ATTCGTGGAT CTTCACGCAG CTACAATGCA 27 60 

GATCTTGGAG GTAAGTTCCA GTTCTAA 27 8 7 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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Met Lys Ser Ser Leu His Trp Phe Val lie Ser Ser Ser Leu Ala Leu 
1 5 10 ' 15 

Pro Leu Ser Leu Asn Phe Ser Ala Phe Ala Ala Val Val Glu lie Asn 
20 25 30 

Leu Gly Pro Thr Asn Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro 
35 40 45 

Ala Gin Thr Thr Asn Ala Asp Gly Thr lie -Tyr Asn Leu Thr Gly Asp 
50 55 60 

Val Ser lie Thr Asn Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys 
65 70 75 80 

Phe Lys Glu Thr Thr Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin 
85 90 95 

Phe Leu Leu Gin Asn lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn 
100 105 110 

Thr Ala Ala Asn Lys Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser 
115 120 125 

Leu lie Gin Thr Thr Asn Ala Thr Thr Gly Thr Gly Ala lie Lys Ser 
130 135 140 

Thr Gly Ala Cys Ser lie Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin 
145 150 155 160 

Asn Phe Ser Asn Asp Asn Gly Gly Ala Leu Gin Gly Ser Ser lie Ser 
165 170 175 

Leu Ser Leu Asn Pro Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin 
180 185 190 

Lys Gly Gly Ala Leu Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr 
195 200 205 

Leu Asn Ser Ala Ser Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly 
210 215 220 

Ala lie Tyr Thr Glu Ala Ser Ser Phe lie Ser Ser Asn Lys Ala lie 
225 230 235 240 

Ser Phe lie Asn Asn Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala 
245 250 255 

lie Tyr Cys Ser Ser Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser 
260 265 270 

Asp Asn Gly Glu Leu Asn Phe lie Gly Asn Thr Ala lie Thr Ser Gly 
275 280 285 

Gly Ala lie Tyr Thr Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr 
290 295 300 
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Leu Phe Lys Asn Asn Ser Ala lie Asp Thr Ala Ala Pro Leu Gly Gly 
305 310 315 320 

Ala lie Ala lie Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly 
325 330 335 

Gly Asp lie Thr Phe Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser 
340 345 350 

Ser Gin Thr Thr Thr Arg Asn Ser lie Asn lie Gly Asn Thr Asn Ala 
355 360 365 

Lys lie Val Gin Leu Arg Ala Ser Gin Gly Asn Thr lie Tyr Phe Tyr 
370 375 380 

Asp Pro lie Thr Thr Asn His Thr Ala Ala Leu Ser Asp Ala Leu Asn 
385 390 395 400 

Leu Asn Gly Pro Asp Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr lie 
405 - 410 415 

Val Phe Ser Gly Glu Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp 
420 425 430 

Asn Leu Lys Ser Thr lie Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin 
435 440 445 

Leu Ser Leu Lys Ser Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin 
450 455 460 

Ser Pro Gly Ser Thr Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr 
465 470 475 480 

Ala Asp Gly lie Thr lie Asn Asn Leu Val Leu Asn Val Asp Ser Leu 
485 490 495 

Lys Glu Thr Lys Lys Ala Thr Leu Lys Ala Thr Gin Ala. Ser Gin Thr 
500 505 510 

Val Thr Leu Ser Gly Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val 
515 520 ^ 525 

Tyr Glu Asp Val Ser Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr 
530 535 540 

Leu Thr Ala Asp Asp Pro Ala Asn lie His lie Thr Asp Leu Ala Ala 
545 550 555 560 

Asp Pro Leu Glu Lys Asn Pro lie His Trp Gly Tyr Gin Gly Asn Trp 
565 570 575 

Ala Leu Ser Trp Gin Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr 
580 585 590 

Leu Thr Trp Thr Lys Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly 
595 600 605 
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Thr Leu Val Ala Asn Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser 
610 615 620 

lie Gin Gin Leu Val Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg 
625 630 635 640 

Gly lie Trp Cys Glu Gly lie Ser Asn Phe Phe His Lys Asp Ser Thr 
645 650 655 

Lys lie Asn Lys Gly Phe Arg His lie Ser Ala Gly Tyr Val Val Gly 
660 665 670 

Ala Thr Thr Thr Leu Ala Ser Asp Asn Leu lie Thr Ala Ala Phe Cys 
675 . 680 685 

Gin Leu Phe Gly Lys Asp Arg Asp His Phe lie Asn Lys Asn Arg Ala 
690 695 700 

Ser Ala Tyr Ala Ala Ser Leu His Leu Gin His Leu Ala Thr Leu Ser 
705 710 715 720 

Ser Pro Ser Leu Leu Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro 
725 730 735 

Val Leu Phe Asp Ala Gin lie Ser Tyr lie Tyr Ser Lys Asn Thr Met 

740 745 750 

Lys Thr Tyr Tyr Thr Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn 

755 760 765 



Ser Ser Leu Pro His Thr Ala Leu 
780 



Asp Gly Cys Ala 
.770 

Ser His Glu Gly 
785 

Ala Ser Tyr lie 



Leu Glu Leu Ala 
775 

Leu Phe His Ala 
790 

His Gin Asp Ser 
805 



Tyr Phe Pro Phe 
795 

Phe Lys Glu Arg 
810 



lie Lys Val Glu 
800 

Asn Thr Thr Leu 
815 

Ser Val Pro lie 
830 

Ala Ser Tyr Glu 
845 

Asn Pro Asp Cys 



Thr Thr Gly Thr 
880 

lie Phe Tyr Ala 
895 



Val Arg Ser Phe 
820 

Gly lie Thr Phe 
835 

Ala Thr Val lie 
850 

Thr Thr Ala Leu 
865 

Asn Leu Ser Arg 



Asp Ser Gly Asp 



Glu Arg Phe Ser 
840 

Tyr Val Ala Asp 
855 

Leu lie Asn Asn 
870 

Gin Ala Gly lie 
885 



Leu lie Asn Val 
825 

Arg Asn Glu Arg 



Val Tyr Arg Lys 
860 

Thr Ser Trp Lys 
875 

Gly Arg Ala Gly 
890 



Phe Ser Pro Asn Leu Glu Val Thr Ser Asn Leu Ser Met Glu lie Arg 
900 905 910 



Gly Ser Ser Arg Ser Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
915 920 925 
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(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



ATGAAAATAC CCTTGCACAA ACTCCTGATC TCTTCGACTC TTGTCACTCC CATTCTATTG 60 

AGCATTGCAA CTTACGGAGC AGATGCTTCT TTATCCCCTA CAGATAGCTT TGATGGAGCG 12 0 

GGCGGCTCTA CATTTACTCC AAAATCTACA GCAGATGCCA ATGGAACGAA CTATGTCTTA 18 0 

TCAGGAAATG TCTATATAAA CGATGCTGGG AAAGGCACAG CATTAACAGG CTGCTGCTTT 24 0 

ACAGAAACTA CGGGTGATCT GACATTTACT GGAAAGGGAT ACTCATTTTC ATTCAACACG 300 

GTAGATGCGG GTTCGAATGC AGGAGCTGCG GCAAGCACAA CTGCTGATAA AGCCCTAACA 360 

TTCACAGGAT TTTCTAACCT TTCCTTCATT GCAGCTCCTG GAACTACAGT TGCTTCAGGA 42 0 

AAAAGTACTT TAAGTTCTGC AGGAGCCTTA AATCTTACCG AT7\ATGGAAC GATTCTCTTT 48 0 

AGCCAAAACG TCTCCAATGA AGCTAATAAC AATGGCGGAG CGATCACCAC AAAAACTCTT 54 0 

TCTATTTCTG GGAATACCTC TTCTATAACC TTCACTAGTA ATAGCGCAAA AAAATTAGGT 60 0 

GGAGCGATCT ATAGCTCTGC GGCTGCAAGT ATTTCAGGAA ACACCGGCCA GTTAGTCTTT 660 

ATGAATAATA AAGGAGAAAC TGGGGGCGGG GCTCTGGGCT TTGAAGCCAG CTCCTCGATT 72 0 

ACTCAAAATA GCTCCCTTTT CTTCTCTGGA AACACTGCAA CAGATGCTGC AGGCAAGGGC '78 0 

GGGGCCATTT ATTGTGAAAA AACAGGAGAG ACTCCTACTC TTACTATCTC TGGAAATAAA 84 0 

AGTCTGACCT TCGCCGAGAA CTCTTCAGTA ACTCAAGGCG GAGCAATCTG TGCCCATGGT 90 0 

CTAGATCTTT CCGCTGCTGG CCCTACCCTA TTTTCAAATA ATAGATGCGG GAACACAGCT 960 

GCAGGCAAGG GCGGCGCTAT TGCAATTGCC GACTCTGGAT CTTTAAGTCT CTCTGCAAAT 102 0 

CAAGGAGACA TCACGTTCCT TGGCAACACT CTAACCTCAA CCTCCGCGCC AACATCGACA 108 0 

CGGAATGCTA TCTACCTGGG ATCGTCAGCA AAAATTACGA ACTTAAGGGC AGCCCAAGGC 114 0 

CAATCTATCT ATTTCTATGA TCCGATTGCA TCTAACACCA CAGGAGCTTC AGACGTTCTG 12 0 0 

ACCATCAACC AACCGGATAG CAACTCGCCT TTAGATTATT CAGGAACGAT TGTATTTTCT 12 60 

GGGGAAAAGC TCTCTGCAGA TGAAGCG7\AA GCTGCTGATA ACTTCACATC TATATTAAAG 132 0 

CAACCATTGG CTCTAGCCTC TGGAACCTTA GCACTCAAAG GAAATGTCGA GTTAGATGTC 138 0 

AATGGTTTCA CACAGACTGA AGGCTCTACA CTCCTCATGC AACCAGGT^C AAAGCTCAAA 14 4 0 

GCAGATACTG AAGCTATCAG TCTTACCAAA CTTGTCGTTG ATCTTTCTGC CTTAGAGGGA 15 0 0 

AATAAGAGTG TGTCCATTGA AACAGCAGGA GCCAACAAAA CTATAACTCT AACCTCTCCT 1560 

CTTGTTTTCC AAGATAGTAG CGGCi^ATTTT TATGAAAGCC ATACGATAAA CCAAGCCTTC 162 0 

ACGCAGCCTT TGGTGGTATT CACTGCTGCT ACTGCTGCTA GCGATATTTA TATCGATGCG 168 0 

CTTCTCACTT CTCCAGTACA AACTCCAGAA CCTCATTACG GGTATCAGGG ACATTGGGAA 174 0 

GCCACTTGGG CAGACACATC AACTGCAAAA TCAGGAACTA TGACTTGGGT AACTACGGGC 18 00 

TACAACCCTA ATCCTGAGCG TAGAGCTTCC GTAGTTCCCG ATTCATTATG GGCATCCTTT 18 60 

ACTGACATTC GCACTCTACA GCAGATCATG ACATCTCAAG CGAATAGTAT CTATCAGCAA 192 0 

CGAGGACTCT GGGCATCAGG AACTGCGAAT TTCTTCCATA AGGATAAATC AGGAACTAAC 198 0 

CAAGCATTCC GACATAAAAG CTACGGCTAT ATTGTTGGAG GAAGTGCTGA AGATTTTTCT 204 0 

GAAAATATCT TCAGTGTAGC TTTCTGCCAG CTCTTCGGTA AAGATAAAGA CCTGTTTATA 2100 

GTTGAAAATA CCTCTCATAA CTATTTAGCG TCGCTATACC TGCAACATCG AGCATTCCTA 2160 

GGAGGACTTC CCATGCCCTC ATTTGGAAGT ATCACCGACA TGCTGAAAGA TATTCCTCTC 2220 

ATTTTGAATG CCCAGCTAAG CTACAGCTAC ACTAAAAATG ATATGGATAC TCGCTATACT 22 8 0 

TCCTATCCTG AAGCTCAAGG TTCTTGGACC AATAATTCTG GGGCTCTAGA GCTCGGAGGA 234 0 

TCTCTGGCTC TATATCTCCC TAAAGAAGCA CCGTTCTTCC AGGGATATTT CCCCTTCTTA 2 4 00 

AAGTTCCAGG CAGTCTACAG CCGCCAACAA AACTTTAAAG AGAGTGGCGC TGAAGCCCGT 2 4 60 

GCTTTTGATG ATGGAGACCT AGTGAACTGC TCTATCCCTG TCGGCATTCG GTTAGAAAAA 252 0 

ATCTCCGAAG ATGAAAAAAA TAATTTCGAG ATTTCTCTAG CCAACATTGG TGATGTGTAT 258 0 

CGTAAAAATC CCCGTTCGCG TACTTCTCTA ATGGTCAGTG GAGCCTCTTG GACTTCGCTA 264 0 

TGTAAAAACC TCGCACGACA AGCCTTCTTA GCAAGTGCTG GAAGCCATCT GACTCTCTCC 27 0 0 
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CCTCATGTAG AACTCTCTGG GGAA.GCTGCT TATGAGCTTC GTGGCTCAGC ACACATCTAC 27 60 
AATGTAGATT GTGGGCTAAG ATACTCATTC TAG 27 93 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Lys lie Pro Leu His Lys Leu Leu lie Ser Ser Thr Leu Val Thr 
1 5 10 15 

Pro lie Leu Leu Ser lie Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser 
20 25 30 

Pro Thr Asp Ser Phe Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys 
35 40 ' 45 

Ser Thr Ala Asp Ala Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val 

50 55 60 

Tyr lie Asn Asp Ala Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe 

65 70 75 80 

Thr Glu Thr Thr Gly Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe 
85 90 95 

Ser Phe Asn Thr Val Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser 
100 105 110 

Thr Thr Ala Asp Lys Ala Leu Thr Phe Thr Gly Phe Ser Asn Leu Ser 
115 120 125 

Phe lie Ala Ala Pro Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu 
130 135 140 

Ser Ser Ala Gly Ala Leu Asn Leu Thr Asp Asn Gly Thr lie Leu Phe 
145 150 155 160 

Ser Gin Asn Val Ser Asn Glu Ala Asn Asn Asn Gly Gly Ala lie Thr 
165 170 175 

Thr Lys Thr Leu Ser lie Ser Gly Asn Thr Ser Ser lie Thr Phe Thr 
180 185 190 

Ser Asn Ser Ala Lys Lys Leu Gly Gly Ala lie Tyr Ser Ser Ala Ala 
195 200 205 

Ala Ser lie Ser Gly Asn Thr Gly Gin Leu Val Phe Met Asn Asn Lys 
210 215 220 



Gly Glu Thr Gly Gly Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser lie 
225 230 235 240 
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Thr Gin Asn Ser Ser Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala 
245 250 255 

Ala Gly Lys Gly Gly Ala lie Tyr Cys Glu Lys Thr Gly Glu Thr Pro 
260 265 270 

Thr Leu Thr lie Ser Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser 
275 280 285 

Ser Val Thr Gin Gly Gly Ala lie Cys Ala His Gly Leu Asp Leu Ser 
290 295 300 

Ala Ala Gly Pro Thr Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala 
305 310 315 320 

Ala Gly Lys Gly Gly Ala lie Ala lie Ala Asp Ser Gly Ser Leu Ser 
325 330 335 

Leu Ser Ala Asn Gin Gly Asp lie Thr Phe Leu Gly Asn Thr Leu Thr 
340 345 350 

Ser Thr Ser Ala Pro Thr Ser Thr Arg Asn Ala lie Tyr Leu Gly Ser 
355 360 365 

Ser Ala Lys lie Thr Asn Leu Arg Ala Ala Gin Gly Gin Ser lie Tyr 
370 375 380 

Phe Tyr Asp Pro lie Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu 
385 390 395 400 

Thr lie Asn Gin Pro Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr 
405 410 415 

lie Val Phe Ser Gly Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala 
420 425 430 

Asp Asn Phe Thr Ser lie Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly 
435 440 445 

Thr Leu Ala Leu Lys Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr 
450 455 460 

Gin Thr Glu Gly Ser Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys 
465 470 475 480 

Ala Asp Thr Glu Ala lie Ser Leu Thr Lys Leu Val Val Asp Leu Ser 
485 490 495 

Ala Leu Glu Gly Asn Lys Ser Val Ser lie Glu Thr Ala Gly Ala Asn 
500 505 510 

Lys Thr lie Thr Leu Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly 
515 520 525 

Asn > Phe Tyr Glu Ser His Thr lie Asn Gin Ala Phe Thr Gin Pro Leu 
530 535 540 
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Val Val Phe Thr Ala Ala Thr Ala Ala Ser Asp lie Tyr lie Asp Ala 
545 550 555 560 

Leu Leu Thr Ser Pro Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin 
565 570 575 

Gly His Trp Glu Ala Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly 
580 585 590 

Thr Met Thr Trp Val Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg 
595 600 605 

Ala Ser Val Val Pro Asp Ser Leu Trp Ala Ser Phe Thr Asp lie Arg 
610 615 620 

Thr Leu Gin Gin lie Met Thr Ser Gin Ala Asn Ser lie Tyr Gin Gin 
625 630 635 640 

Arg Gly Leu Trp Ala Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys 
645 650 655 

Ser Gly Thr Asn Gin Ala Phe Arg His Lys Ser Tyr Gly Tyr lie Val 
660 665 670 

Gly Gly Ser Ala Glu Asp Phe Ser Glu Asn lie Phe Ser Val Ala Phe 
675 680 685 

Cys Gin Leu Phe Gly Lys Asp Lys Asp Leu Phe lie Val Glu Asn Thr 
690 695 700 

Ser His Asn Tyr Leu Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu 
705 710 715 ' 720 

Gly Gly Leu Pro Met Pro Ser Phe Gly Ser lie Thr Asp Met Leu Lys 
725 730 735 

Asp lie Pro Leu lie Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys 
740 745 750 

Asn Asp Met Asp Thr Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser 
755 760 765 

Trp Thr Asn Asn Ser Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu 
770 775 780 

Tyr Leu Pro Lys Glu Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu 
785 790 795 800 

Lys Phe Gin Ala Val Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly 
805 810 815 

Ala Glu Ala Arg Ala Phe Asp Asp Gly Asp Leu Val Asn Cys Ser lie 
820 825 830 



Pro Val Gly lie Arg Leu Glu Lys lie Ser Glu Asp Glu Lys Asn Asn 
835 840 845 
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Phe Glu lie Ser Leu Ala Asn lie Gly Asp Val Tyr Arg Lys Asn Pro 
850 855 860 

Arg Ser Arg Thr Ser Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu 
865 870 875 880 

Cys Lys Asn Leu Ala Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His 
885 890 895 

Leu Thr Leu Ser Pro His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu 
900 905 910 

Leu Arg Gly Ser Ala His lie Tyr Asn Val Asp Cys Gly Leu Arg Tyr 
915 920 925 

Ser Phe 
930 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 840 base pairs 
{B)"TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GAAGACAATA TAAGGTACCG TCATAACAGC GGGGGTTATG CACTAGGGAT CACAGCAACA 60 

ACTCCTGCCG AGGATCAGCT TACTTTTGCC TTCTGCCAGC TCTTTGCTAG AGATCGCAAT 12 0 

CATATTACAG GTAAGAACCA CGGAGATACT TACGGTGCCT CTTTGTATTT CCACCATACA 18 0 

GAAGGGCTCT TCGACATCGC CAATTTCCTC TGGGGAAAAG CAACCCGAGC TCCCTGGGTG 24 0 

CTCTCTGAGA TCTCCCAGAT CATTCCTTTA TCGTTCGATG CTAAATTCAG TTATCTCCAT 300 

ACAGACAACC ACATGAAGAC ATATTATACC GATAACTCTA TCATCAAGGG TTCTTGGAGA .360 

AACGATGCCT TCTGTGCAGA TCTTGGAGCT AGCCTGCCTT TTGTTATTTC CGTTCCGTAT 42 0 

CTTCTGAAAG AAGTCGAACC TTTTGTCAAA GTACAGTATA TCTATGCGCA TCAGCAAGAC 4 80 

TTCTACGAGC GTCATGCTGA AGGACGCGCT TTCAATAAAA GCGAGCTTAT CAACGTAGAG 54 0 

ATTCCTATAG GCGTCACCTT CGAAAGAGAC TCAAAATCAG AAAAGGGAAC TTACGATCTT 600 

ACTCTTATGT ATATACTCGA TGCTTACCGA CGCAATCCTA AATGTCAAAC TTCCCTAATA 660 

GCTAGCGATG CTAACTGGAT GGCCTATGGT ACCAACCTCG CACGACAAGG TTTTTCTGTT 72 0 

CGTGCTGCGA ACCATTTCCA AGTGAACCCC CACATGGAAA TCTTCGGTCA ATTCGCTTTT 78 0 

GAAGTACGAA GTTCTTCACG AAATTATAAT ACAAACCTAG GCTCTAAGTT TTGTTTCTAG 84 0 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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Glu Asp Asn lie Arg Tyr Arg His Asn Ser Gly Gly Tyr Ala Leu Gly 
15 10 15 

lie Thr Ala Thr Thr Pro Ala Glu Asp Gin Leu Thr Phe Ala Phe Cys 
20 25 30 

Gin Leu Phe Ala Arg Asp Arg Asn His lie Thr Gly Lys Asn His Gly 
35 40 45 

Asp Thr Tyr Gly Ala Ser Leu Tyr Phe His His Thr Glu Gly Leu Phe 
50 55 60 

Asp lie Ala Asn Phe Leu Trp Gly Lys Ala Thr Arg Ala Pro Trp Val 
65 70 75 80 

Leu Ser Glu lie Ser Gin lie lie Pro Leu Ser Phe Asp Ala Lys Phe 
85 90 95 

Ser Tyr Leu His Thr Asp Asn His Met Lys Thr Tyr Tyr Thr Asp Asn 
100 105 110 

Ser lie lie Lys Gly Ser Trp Arg Asn Asp Ala Phe Cys Ala Asp Leu 
115 120 125 

Gly Ala Ser Leu Pro Phe Val lie Ser Val Pro Tyr Leu Leu Lys Glu 
130 135 140 

Val Glu Pro Phe Val Lys Val Gin Tyr lie Tyr Ala His Gin Gin Asp 
145 150 155 160 

Phe Tyr Glu Arg His Ala Glu Gly Arg Ala Phe Asn Lys Ser Glu Leu 
165 170 175 

lie Asn Val Glu lie Pro lie Gly Val Thr Phe Glu Arg Asp Ser Lys 
180 185 190 

Ser Glu Lys Gly Thr Tyr Asp Leu Thr Leu Met Tyr lie Leu Asp Ala 
195 200 205 

Tyr Arg Arg Asn Pro Lys Cys Gin Thr Ser Leu lie Ala Ser Asp Ala 
210 215 220 

Asn Trp Met Ala Tyr Gly Thr Asn Leu Ala Arg Gin Gly Phe Ser Val 
225 230 235 240 

Arg Ala Ala Asn His Phe Gin Val Asn Pro His Met Glu lie Phe Gly 
245 250 255 

Gin Phe Ala Phe Glu Val Arg Ser Ser Ser Arg Asn Tyr Asn Thr Asn 
260 265 270 

Leu Gly Ser Lys Phe Cys Phe 
275 

(2) INFORMATION FOR SEQ ID NO: 19: 



(i) 



SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1545 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATGACCATAC TTCGAAATTT TCTTACCTGC TCGGCTTTAT TCCTCGCTCT CCCTGCAGCA 60 

GCACAAGTTG TATATCTTCA TGAAAGTGAT GGTTATAACG GTGCTATCAA TAATAAAAGC 12 0 

TTAGAACCTA AAATTACCTG TTATCCAGAA GGAACTTCTT ACATCTTTCT AGATGACGTG 18 0 

AGGATTTCCA ACGTTAAGCA TGATCAAGAA GATGCTGGGG TTTTTATAAA TCGATCTGGG 240 

AATCTTTTTT TCATGGGCAA CCGTTGCAAC TTCACTTTTC ACAACCTTAT GACCGAGGGT 300 

TTTGGCGCTG CCATTTCGAA CCGCGTTGGA GACACCACTC TCACTCTCTC TAATTTTTCT 360 

TACTTAACGT TCACCTCAGC ACCTCTACTA CCTCAAGGAC AAGGAGCGAT TTATAGTCTT 42 0 

GGTTCCGTGA TGATCGAAAA TAGTGAGGAA GTGACTTTCT GTGGGAACTA CTCTTCGTGG 4 80 

AGTGGAGCTG CGATTTATAC TCCCTACCTT TTAGGTTCTA AGGCGAGTCG TCCTTCAGTA 54 0 

AATCTCAGCG GGAACCGCTA CCTGGTGTTT AGAGACTATG TGAGCCAAGG TTATGGCGGC 600 

GCCGTATCTA CCCACAATCT CACACTCACG ACTCGAGGAC CTTCGTGTTT TGAAAATAAT 660 

CATGCTTATC ATGACGTGAA TAGTAATGGA GGAGCCATTG CCATTGCTCC TGGAGGATCG 72 0 

ATCTCTATAT CCGTGAAAAG CGGAGATCTC ATCTTCAAAG GAAATACAGC ATCACAAGAC 78 0 

GGAAATACAA TACACAACTC CATCCATCTG CAATCTGGAG CACAGTTT7VA GAACCTACGT 84 0 

GCTGTTTCAG AATCCGGAGT TTATTTCTAT GATCCTATAA GCCATAGCGA GTCGCATAAA 900" 

ATTACAGATC TTGTAATCAA TGCTCCTGAA GGAAAGGAAA CTTATGAAGG AACAATTAGC 960 

TTCTCAGGAC TATGCCTGGA TGATCATGAA GTTTGTGCGG AAAATCTTAC TTCCACAATC 102 0 

CTACAAGATG TCACATTAGC AGGAGGAACT CTCTCTCTAT CGGATGGGGT TACCTTGCAA 108 0 

CTGCATTCTT TTAAGCAGGA AGCAAGCTCT ACGCTTACTA TGTCTCCAGG AACCACTCTG 114 0 

CTCTGCTCAG GAGATGCTCG GGTTCAGAAT CTGCACATCC TGATTGAAGA TACCGACAAC 12 0 0 

TTTGTTCCTG TAAGGATTCG CGCCGAGGAC AAGGATGCTC TTGTCTCATT AGAAAAACTT 12 60 

AAAGTTGCCT TTGAGGCTTA TTGGTCCGTC TATGACTTTC CTCAATTTAA GGAAGCCTTT 132 0 

ACGATTCCTC TTCTTGAACT TCTAGGGCCT TCTTTTGACA GTCTTCTCCT AGGGGAGACC 138 0 

ACTTTGGAGA GAACCCAAGT CACAACAGAG AATGACGCCG TTCGAGGTTT CTGGTCCCTA 144 0 

AGCTGGGAAG AGTACCCCCC TTCTCTGGAT AAAGACAGAA GGATCACACC AACTAAGAAA 150 0 

ACTGTTTTCC TCACTTGGAA TCCTGAGATC ACTTCTACGC CATAA 154 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Met Thr lie Leu Arg Asn Phe Leu Thr Cys Ser Ala Leu Phe Leu Ala 
15 10 15 

Leu Pro Ala Ala Ala Gin Val Val Tyr Leu His Glu Ser Asp Gly Tyr 
20 25 30 

Asn Gly Ala lie Asn Asn Lys Ser Leu Glu Pro Lys lie Thr Cys Tyr 
35 40 45 



Pro Glu Gly Thr Ser Tyr lie Phe Leu Asp Asp Val Arg lie Ser Asn 
50 55 60 
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Val Lys His Asp Gin Glu Asp Ala Gly Val Phe lie Asn Arg Ser Gly 
65 70 75 80 

Asn Leu Phe - Phe Met Gly Asn Arg Cys Asn Phe Thr Phe His Asn Leu 
85 ■ 90 95 

Met Thr Glu Gly Phe Gly Ala Ala lie Ser Asn Arg Val Gly Asp Thr 
100 105 110 

Thr Leu Thr Leu Ser Asn Phe Ser Tyr Leu Thr Phe Thr Ser Ala Pro 
115 120 125 

Leu Leu Pro Gin Gly Gin Gly Ala lie Tyr Ser Leu Gly Ser Val Met 
130 135 140 

lie Glu Asn Ser Glu Glu Val Thr Phe Cys Gly Asn Tyr Ser Ser Trp 
145 150 155 160 

Ser Gly Ala Ala lie Tyr Thr Pro Tyr Leu Leu Gly Ser Lys Ala Ser 
165 170 175 

Arg Pro Ser Val Asn Leu Ser Gly Asn Arg Tyr Leu Val Phe Arg Asp 
180 185 190 

Tyr Val Ser Gin Gly Tyr Gly Gly Ala Val Ser Thr His Asn Leu Thr 
195 200 205 

Leu Thr Thr Arg Gly Pro Ser Cys Phe Glu Asn Asn His Ala Tyr His 
210 215 220 

Asp Val Asn Ser Asn Gly Gly Ala lie Ala lie Ala Pro Gly Gly Ser 
225 230 235 240 

lie Ser lie Ser Val Lys Ser Gly Asp Leu lie Phe Lys Gly Asn Thr 
245 250 255 

Ala Ser Gin Asp Gly Asn Thr lie His Asn Ser lie His Leu Gin Ser 
260 265 270 

Gly Ala Gin Phe Lys Asn Leu Arg Ala Val Ser Glu Ser Gly Val Tyr 
275 280 285 

Phe Tyr Asp Pro lie Ser His Ser Glu Ser His Lys lie Thr Asp Leu 
290 295 300 

Val lie Asn Ala Pro Glu Gly Lys Glu Thr Tyr Glu Gly Thr lie Ser 
305 310 315 320 

Phe Ser Gly Leu Cys Leu Asp Asp His Glu Val Cys Ala Glu Asn Leu 
325 330 335 

Thr Ser Thr lie Leu Gin Asp Val Thr Leu Ala Gly Gly Thr Leu Ser 
340 345 350 



Leu Ser Asp Gly Val Thr Leu Gin Leu His Ser Phe Lys Gin Glu Ala 
355 360 365 
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Ser Ser Thr Leu Thr Met Ser Pro Gly Thr Thr Leu Leu Cys Ser Gly 
370 375 380 

Asp Ala Arg Val Gin Asn Leu His lie Leu lie Glu Asp Thr 'Asp Asn 
385 390 395 400 

Phe Val Pro Val Arg lie Arg Ala Glu Asp Lys Asp Ala Leu Val Ser 
405 410 415 

Leu Glu Lys Leu Lys Val Ala Phe Glu Ala Tyr Trp Ser Val Tyr Asp 
420 425 430 

Phe Pro Gin Phe Lys Glu Ala Phe Thr lie Pro Leu Leu Glu Leu Leu 
435 440 445 

Gly Pro Ser Phe Asp Ser Leu Leu Leu Gly Glu Thr Thr Leu Glu Arg 
450 455 460 

Thr Gin Val Thr Thr Glu Asn Asp Ala Val Arg Gly Phe Trp Ser Leu 
465 470 475 480 

Ser Trp Glu Glu Tyr Pro Pro Ser Leu Asp Lys Asp Arg Arg lie Thr 
485 490 " 495 

Pro Thr Lys Lys Thr Val Phe Leu Thr Trp Asn Pro Glu lie Thr Ser 
500 505 510 

Thr Pro 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 

ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATGAAAACGT CTATTCGTAA GTTCTTAATT TCTACCACAC TGGCGCCATG TTTTGCTTCA 60 

ACAGCGTTTA CTGTAGAAGT TATCATGCCT TCCGAGAACT TTGATGGATC GAGTGGGAAG 120 

ATTTTTCCTT ACACAACACT TTCTGATCCT AGAGGGACAC TCTGTATTTT TTCAGGGGAT 180 

CTCTACATTG CGAATCTTGA TAATGCCATA TCCAGAACCT CTTCCAGTTG CTTTAGCAAT 24 0 

AGGGCGGGAG CACTACAAAT CTTAGGAAAA GGTGGGGTTT TCTCCTTCTT AAATATCCGT 300 

TCTTCAGCTG ACGGAGCCGC GATTAGTAGT GTAATCACCC AAAATCCTGA ACTATGTCCC 360 

TTGAGTTTTT CAGGATTTAG TCAGATGATC TTCGATAACT GTGAATCTTT GACTTCAGAT 42 0 

ACCTCAGCGA GTAATGTCAT ACCTCACGCA TCGGCGATTT ACGCTACAAC GCCCATGCTC 4 80 

TTTACAAACA ATGACTCCAT ACTATTCCAA TACAACCGTT CTGCAGGATT TGGAGCTGCC 54 0 

ATTCGAGGCA CAAGCATCAC AATAGAAAAT ACGAAAAAGA GCCTTCTCTT TAATGGTAAT 600 

GGATCCATCT CTAATGGAGG GGCCCTCACG GGATCTGCAG CGATCAACCT CATCT^CAAT 660 

AGCGCTCCTG TGATTTTCTC AACGAATGCT ACAGGGATCT ATGGTGGGGC TATTTACCTT 72 0 

ACCGGAGGAT CTATGCTCAC CTCTGGGAAC CTCTCAGGAG TCTTGTTCGT TTATAATAGC 7 80 

TCGCGCT 7 87 



(2) INFORMATION FOR SEQ ID NO: 22: 



78 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Lys Thr Ser lie Arg Lys Phe Leu lie Ser Thr Thr Leu Ala Pro 
1 5 10 15 

Cys Phe Ala Ser Thr Ala Phe Thr Val Glu Val lie Met Pro Ser Glu 
20 25 30 

Asn Phe Asp Gly Ser Ser Gly Lys lie Phe Pro Tyr Thr Thr Leu Ser 
35 40 45 

Asp Pro Arg Gly Thr Leu Cys lie Phe Ser Gly Asp Leu Tyr lie Ala 
50 55 60 

Asn Leu Asp Asn Ala lie Ser Arg Thr Ser Ser Ser Cys Phe Ser Asn 
65 70 75 80 

Arg Ala Gly Ala Leu Gin lie Leu Gly Lys Gly Gly Val Phe Ser Phe 
85 90 95 

Leu Asn lie Arg Ser Ser Ala Asp Gly Ala Ala lie Ser Ser Val lie 
100 105 110 

Thr Gin Asn Pro Glu Leu Cys Pro Leu Ser Phe Ser Gly Phe Ser Gin 
115 120 125 

Met lie Phe Asp Asn Cys Glu Ser Leu Thr Ser Asp Thr Ser Ala Ser 
130 135 140 

Asn Val lie Pro His Ala Ser Ala lie Tyr Ala Thr Thr Pro Met Leu 
145 150 155- 160 

Phe Thr Asn Asn Asp Ser lie Leu Phe Gin Tyr Asn Arg Ser Ala Gly 
165 170 175 

Phe Gly Ala Ala lie Arg Gly Thr Ser lie Thr lie Glu Asn Thr Lys 
180 185 190 

Lys Ser Leu Leu Phe Asn Gly Asn Gly Ser lie Ser Asn Gly Gly Ala 
195 200 205 

Leu Thr Gly Ser Ala Ala lie Asn Leu lie Asn Asn Ser Ala Pro Val 
210 215 220 

lie Phe Ser Thr Asn Ala Thr Gly lie Tyr Gly Gly Ala lie Tyr Leu 
225 230 * 235 240 

Thr Gly Gly Ser Met Leu Thr Ser Gly Asn Leu Ser Gly Val Leu Phe 
245 250 255 
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Val Tyr Asn Ser Ser Arg 
260 

(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



ATGAAGACTT CAGTTTCTAT GTTGTTGGCC CTGCTTTGCT CGGGGGCTAG CTCTATTGTA 60 

CTCCATGCCG CAACCACTCC ACTAAATCCT GAAGATGGGT TTATTGGGGA GGGCAATACA 12 0 

AATACTTTTT CTCCGAAATC TACAACGGAT GCTGCAGGAA CTACCTACTC TCTCACAGGA 18 0 

GAGGTTCTGT TTATAGATCC GGGGAAAGGT GGTTCAATTA CAGGAACTTG CTTTGTAGAA 24 0 

ACTGCTGGCG ATCTTACATT TTTAGGTAAT GGAAATACCC TAAAGTTCCT GTCGGTAGAT 300 

GCAGGTGCTA ATATCGCGGT TGCTCATGTA CAAGGAAGTA AGAATTTAAG CTTCACAGAT 360 

TTCCTTTCTC TGGTGATCAC AGAATCTCCA AAATCCGCTG TTAGTACAGG AAAAGGTAGC 42 0 

' CTAGTCAGTT CAGGTGCAGT CCAACTGCAA GATATAAACA CTCTAGTTCT TACAAGCAAT 4 80 

GCCTCTGTCG AAGATGGTGG CGTGATTAAA GGAAACTCCT GCTTGATTCA GGGAATCAAA 54 0 

AATAGTGCGA TTTTTGGACA AAATACATCT TCGAAAAAAG GAGGGGCGAT CTCCACGACT 600 

CAAGGACTCA CCATAGAGAA TAACTTAGGG ACGCTAAAGT TCAATGAAAA CAAAGCAGTG 660 

ACCTCAGGAG GCGCCTTAGA TTTAGGAGCC GCGTCTACAT TCACTGCGAA CCATGAGTTG 72 0 

ATATTTTCAC AAAATAAGAC TTCTGGGAAT GCTGCAAATG GCGGAGCCAT AAATTGCTCA 7 80 

GGCGACCTAA CATTTACTGA TAACACTTCT TTGTTACTTC AAGAAAATAG CACAATGCAG 84 0 

GATGGTGGAG CTTTGTGTAG CACAGGAACC ATAAGCATTA CCGGTAGTGA TTCTATCAAT 90 0 

GTGATAGGAA ATACTTCAGG ACAAAAAGGA GGAGCGATTT CTGCAGCTTC TCTCAAGATT 9 60 

TTGGGAGGGC AGGGAGGCGC TCTCTTTTCT AATAACGTAG TGACTCATGC CACCCCTCTA 102 0 

GGAGGTGCCA TTTTTATCAA CACAGGAGGA TCCTTGCAGC TCTTCACTCA AGGAGGGGAT 108 0 

ATCGTATTCG AGGGGAATCA GGTCACTACA ACAGCTCCAA ATGCTACCAC TAAGAGAAAT 114 0 

GTAATTCACC TCGAGAGCAC CGCGAAGTGG ACGGGACTTG CTGCAAGTCA AGGTAACGCT 12 00 

ATCTATTTCT ATGATCCCAT TACCACCAAC GATACGGGAG CAAGCGATAA CTTACGTATC 12 60 

AATGAGGTCA GTGCAAATCA AAAGCTCTCG GGATCTATAG TATTTTCTGG AGAGAGATTG 132 0 

TCGACAGCAG AAGCTATAGC TGAAAATCTT ACTTCGAGGA TCAACCAGCC TGTCACTTTA 138 0 

GTAGAGGGGA GCTTAGAACT TAAACAGGGA GTGACCTTGA TCACACAAGG ATTCTCGCAG 14 4 0 

GAGCCAGAAT CCACGCTTCT TTTGGATTTG GGGACCTCAT TACAAGCTTC TACAGAAGAT 150 0 

ATCGTCATCA CAAATTCATC TATAAATGCC GATACCATTT ACGGAAAGAA TCCAATCAAT 15 60 

ATTGTAGCTT CAGCAGCGAA TAAGAACATT ACCCTAACAG GAACCTTAGC ACTTGTAAAT 162 0 

GCAGATGGAG CTTTGTATGA GAACCATACC TTGCAAGACT CTCAAGATTA TAGCTTTGTA 168 0 

AAGTTATCTC CAGGAGCGGG AGGGACTATA ATTACTCAAG ATGCTTCTCA GAAGCTTCTT 17 4 0 

GAAGTAGCTC CTTCTAGACC ACATTATGGC TATCAAGGAC ATTGGAATGT GCAAGTCATC 18 00 

CCAGGAACGG GAACTCAACC GAGCCAGGCA AATTTAGAAT GGGTGCGGAC AGGATACCTT 18 60 

CCGAATCCCG AACGGCAAGG ATTTTTAGTT CCCAATAGCC TGTGGGGTTC TTTTGTTGAT 192 0 

CAGCGTGCTA TCCAAGAAAT CATGGTAAAT AGTAGCCAAA TCTTATGTCA GGAACGGGGA 198 0 

GTCTGGGGAG CTGGAATTGC TAATTTCCTA CATAGAGATA AAATTAATGA GCACGGCTAT 2 04 0 

CGCCATAGCG GTGTCGGTTA TCTTGTGGGA GTTGGCACTC ATGCTTTTTC TGATGCTACG 2100 

ATAAATGCGG CTTTTTGCCA GCTCTTCAGT AGAGATAAAG ACTACGTAGT ATCCAAAAAT 2160 

CATGGAACTA GCTACTCAGG GGTCGTATTT CTTGAGGATA CCCTAGAGTT TAGAAGTCCA 222 0 

CAGGGATTCT ATACTGATAG CTCCTCAGAA GCTTGCTGTA ACCAAGTCGT CACTATAGAT 22 8 0 

ATGCAGTTGT CTTACAGCCA TAGAAATAAT GATATGAAAA CCAAATACAC GACATATCCA 234 0 

GAAGCTCAGG GATCTTGGGC AAATGATGTT TTTGGTCTTG AGTTTGGAGC GACTACATAC 24 00 

TACTACCCTA ACAGTACTTT TTTATTTGAT TACTACTCTC CGTTTCTCAG GCTGCAGTGC 2 4 60 

ACCTATGCTC ACCAGGAAGA CTTCAAAGAG ACAGGAGGTG AGGTTCGTCA CTTTACTAGC 252 0 
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GGAGATCTTT TCAATTTAGC AGTTCCTATT GGCGTGAAGT TTGAGAGATT TTCAGACTGT 258 0 

AAAAGGGGAT CTTATGAACT TACCCTTGCT TATGTTCCTG ATGTGATTCG CAAAGATCCC 2 64 0 

AAGAGCACGG CAACATTGGC TAGTGGAGCT ACGTGGAGCA CCCACGGAAA CAATCTCTCC 27 0 0 

AGACAAGGAT TACAACTGCG TTTAGGGAAC CACTGTCTCA TAAATCCTGG AATTGAGGTG 2760 

TTCAGTCACG GAGCTATTGA ATTGCGGGGA TCCTCTCGTA ATTATAACAT CAATCTCGGG 2 820 

GGTAAATACC GATTTTAA 2 83 8 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Lys Thr Ser Val Ser Met Leu Leu Ala Leu Leu Cys Ser Gly Ala 
15 10 15 

Ser Ser lie Val Leu His Ala Ala Thr Th"r Pro "Leu Asn Pro Glu Asp 
20 25 30 

Gly Phe lie Gly Glu Gly Asn Thr Asn Thr Phe Ser Pro Lys Ser Thr 
35 40 45 

Thr Asp Ala Ala Gly Thr Thr Tyr Ser Leu Thr Gly Glu Val Leu Phe 
50 55 60 

lie Asp Pro Gly Lys Gly Gly Ser lie Thr Gly Thr Cys Phe Val Glu 
65 70 75 80 

Thr Ala Gly Asp Leu Thr Phe Leu Gly Asn Gly Asn Thr Leu Lys Phe 
85 90 95 

Leu Ser Val Asp Ala Gly Ala Asn lie Ala Val Ala His Val Gin Gly 
100 105 110 

Ser Lys Asn Leu Ser Phe Thr Asp Phe Leu Ser Leu Val lie Thr Glu 
115 120 125 

Ser Pro Lys Ser Ala Val Ser Thr Gly Lys Gly Ser Leu Val Ser Ser 
130 135 140 

Gly Ala Val Gin Leu Gin Asp lie Asn Thr Leu Val Leu Thr Ser Asn 
145 150 155 160 

Ala Ser Val Glu Asp Gly Gly Val lie Lys Gly Asn Ser Cys Leu lie 
165 170 175 

Gin Gly lie Lys Asn Ser Ala lie Phe Gly Gin Asn Thr Ser Ser Lys 
180 185 190 



Lys Gly Gly Ala lie Ser Thr Thr Gin Gly Leu Thr lie Glu Asn Asn 
195 200 205 
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Leu Gly Thr Leu Lys Phe Asn Glu Asn Lys Ala Val Thr Ser Gly Gly 
210 215 220 

Ala Leu Asp Leu Gly Ala Ala Ser Thr Phe Thr Ala Asn His Glu Leu 
225 230 235 240 

lie Phe Ser Gin Asn Lys Thr Ser Gly Asn Ala Ala Asn Gly Gly Ala 
245 250 255 

lie Asn Cys Ser Gly Asp Leu Thr Phe Thr Asp Asn Thr Ser Leu Leu 
260 265 270 

Leu Gin Glu Asn Ser Thr Met Gin Asp Gly Gly Ala Leu Cys Ser Thr 
275 280 285 

Gly Thr lie Ser lie Thr Gly Ser Asp Ser lie Asn Val lie Gly Asn 
290 295 300 

Thr Ser Gly Gin Lys Gly Gly Ala lie Ser Ala Ala Ser Leu Lys lie 
305 310 315 320 

Leu Gly Gly Gin Gly Gly Ala Leu Phe Ser Asn Asn Val Val Thr His 
325 330 • 335 

Ala Thr Pro Leu Gly Gly Ala lie Phe lie Asn Thr Gly Gly Ser Leu 
340 345 350 

Gin Leu Phe Thr Gin Gly Gly Asp lie Val Phe Glu Gly Asn Gin Val 
355 360 365 

Thr Thr Thr Ala Pro Asn Ala Thr Thr Lys Arg Asn Val lie His Leu 
370 375 380 

Glu Ser Thr Ala Lys Trp Thr Gly Leu Ala Ala Ser Gin Gly Asn Ala 
385 390 395 400 

lie Tyr Phe Tyr Asp Pro lie Thr Thr Asn Asp Thr Gly Ala Ser Asp 
405 410 415 

Asn Leu Arg lie Asn Glu Val Ser Ala Asn Gin Lys Leu Ser Gly Ser 
420 425 430 

lie Val Phe Ser Gly Glu Arg Leu Ser Thr Ala Glu Ala lie Ala Glu 
435 440 445 

Asn Leu Thr Ser Arg lie Asn Gin Pro Val Thr Leu Val Glu Gly Ser 
450 455 460 

Leu Glu Leu Lys Gin Gly Val Thr Leu lie Thr Gin Gly Phe Ser Gin 
465 470 475 480 

Glu Pro Glu Ser Thr Leu Leu Leu Asp Leu Gly Thr Ser Leu Gin Ala 
485 490 495 

Ser Thr Glu Asp lie Val lie Thr Asn Ser Ser lie Asn Ala Asp Thr 
500 505 510 
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lie Tyr Gly Lys Asn Pro lie Asn lie Val Ala Ser Ala Ala Asn Lys 
515 520 525 

Asn lie Thr Leu Thr Gly Thr Leu Ala Leu Val Asn Ala Asp Gly Ala 
530 535 540 

Leu Tyr Glu Asn His Thr Leu Gin Asp Ser Gin Asp Tyr Ser Phe Val 
545 550 555 560 

Lys Leu Ser Pro Gly Ala Gly Gly Thr lie lie Thr Gin Asp Ala Ser 
565 570 575 

Gin Lys Leu Leu Glu Val Ala Pro Ser Arg Pro His Tyr Gly Tyr Gin 
580 585 590 

Gly His Trp Asn Val Gin Val lie Pro Gly Thr Gly Thr Gin Pro Ser 
595 600 605 

Gin Ala Asn Leu Glu Trp Val Arg Thr Gly Tyr Leu Pro Asn Pro Glu 
610 615 620 

Arg Gin Gly Phe Leu Val Pro Asn .Ser Leu Trp Gly Ser Phe Val Asp 
625 630 635 640 

Gin Arg Ala lie Gin Glu lie Met Val Asn Ser Ser Gin lie Leu Cys 
645 650 655 

Gin Glu Arg Gly Val Trp Gly Ala Gly lie Ala Asn Phe Leu His Arg 
660 665 670 

Asp Lys lie Asn Glu His Gly Tyr Arg His Ser Gly Val Gly Tyr Leu 
675 680 685 

Val Gly Val Gly Thr His Ala Phe Ser Asp Ala Thr lie Asn Ala Ala 
690 695 700 

Phe Cys Gin Leu Phe Ser Arg Asp Lys Asp Tyr Val Val Ser Lys Asn 
705 710 715 720 

His Gly Thr Ser Tyr Ser Gly Val Val Phe Leu Glu Asp Thr Leu Glu 
725 730 735 

Phe Arg Ser Pro Gin Gly Phe Tyr Thr Asp Ser Ser Ser Glu Ala Cys 
740 745 750 

Cys Asn Gin Val Val Thr lie Asp Met Gin Leu Ser Tyr Ser His Arg 
755 760 765 

Asn Asn Asp Met Lys Thr Lys Tyr Thr Thr Tyr Pro Glu Ala Gin Gly 
770 775 780 

Ser Trp Ala Asn Asp Val Phe Gly Leu Glu Phe Gly Ala Thr Thr Tyr 
785 790 795 800 

Tyr Tyr Pro Asn Ser Thr Phe Leu Phe Asp Tyr Tyr Ser Pro Phe Leu 
805 810 815 
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Arg Leu Gin Cys Thr Tyr Ala His Gin Glu Asp Phe Lys Glu Thr Gly 
820 825 830 

Gly Glu Val Arg His Phe Thr Ser Gly Asp Leu Phe Asn Leu Ala Val 
835 840 845 

Pro lie Gly Val Lys Phe Glu Arg Phe Ser Asp Cys Lys Arg Gly Ser 
850 855 860 

Tyr Glu Leu Thr Leu Ala Tyr Val Pro Asp Val lie Arg Lys Asp Pro 
865 870 875 880 

Lys Ser Thr Ala Thr Leu Ala Ser Gly Ala Thr Trp Ser Thr His Gly 
885 890 895 

Asn Asn Leu Ser Arg Gin Gly Leu Gin Leu Arg Leu Gly Asn His Cys 
900 905 910 

Leu lie Asn Pro Gly lie Glu Val Phe Ser His Gly Ala lie Glu Leu 
915 920 925 



Arg Gly Ser Ser Arg Asn Tyr Asn lie Asn Leu Gly Gly Lys Tyr Arg 
930 935 940 

Phe 
945 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 259... 3000 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATCAGGTGAT AAAAGTTCCT CGTTAGCTAG TGACTGTAGG TGACATGAGA AAGCTAACAC 60 

GGAGGAAACT AAAACCCAAG GAATCGAAGT CTTCATGGTA ATGCTTTTGT TTTTTAGAGA 120 

ACTATTCGCA TCAATATAGA AACAAAATAA GTAAATCAAG TTAAAGATGA CAAAACAGCT 180 

GTCAAGAATT TTTATCTTGA CTCTCTGAGT TTTCTATTTT ATATGACGCA AGTT^AGAATT 24 0 

TAATAATAAA GTGGGTTT ATG AAA TCG CAA TTT TCC TGG TTA GTG CTC TCT 291 

Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser 
15 10 



TCG ACA TTG GCA TGT TTT ACT AGT TGT TCC ACT GTT TTT GCT GCA ACT 
Ser Thr Leu Ala Cys Phe Thr Ser Cys Ser Thr Val Phe Ala Ala Thr 
15 20 25 



339 
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GCT GAA AAT ATA GGC CCC TCT GAT AGC TTT GAC GGA AGT ACT AAC ACA 3 87 

Ala Glu Asn lie Gly Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr 
30 35 40 

GGC ACC TAT ACT CCT AAA AAT ACG ACT ACT GGA ATA GAC TAT ACT CTG 4 35 

Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr Gly lie Asp Tyr Thr Leu 
45 50 55 

ACA GGA GAT ATA ACT CTG CAA AAC CTT GGG GAT TCG GCA GCT TTA ACG 4 83 

Thr Gly Asp lie Thr Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr 

60 65 70 75 

AAG GGT TGT TTT TCT GAC ACT ACG GAA TCT TTA AGC TTT GCC GGT AAG 531 

Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys 
80 85 90 

GGG TAC TCA CTT TCT TTT TTA AAT ATT AAG TCT AGT GCT GAA GGC GCA 57 9 

Gly Tyr Ser Leu Ser Phe Leu Asn lie Lys Ser Ser Ala Glu Gly Ala 
95 100 105 

GCA CTT TCT GTT ACA ACT GAT AAA AAT CTG TCG CTA ACA GGA TTT TCG 62 7 

Ala Leu Ser Val Thr Thr "Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser 
110 115 120 

AGT CTT ACT TTC TTA GCG GCC CCA TCA TCG GTA ATC ACA ACC CCC TCA 67 5 

Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser Val lie Thr Thr Pro Ser 
125 130 135 

GGA AAA GGT GCA GTT AAA TGT GGA GGG GAT CTT ACA TTT GAT AAC AAT 72 3 

Gly Lys Gly Ala Val Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn 

140 145 150 155 

GGA ACT ATT TTA TTT AAA CAA GAT TAC TGT GAG GAA AAT GGC GGA GCC 771 

Gly Thr lie Leu Phe Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala 
160 165 170 

ATT TCT ACC AAG AAT CTT TCT TTG AAA AAC AGC ACG GGA TCG ATT TCT 819 

lie Ser Thr Lys Asn Leu Ser Leu Lys Asn Ser Thr Gly Ser lie Ser 
175 180 185 

TTT GAA GGG AAT AAA TCG AGC GCA ACA GGG AAA AAA GGT GGG GCT ATT 8 67 

Phe Glu Gly Asn Lys Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala lie 
190 195 200 

TGT GCT ACT GGT ACT GTA GAT ATT ACA AAT AAT ACG GCT CCT ACC CTC 915 

Cys Ala Thr Gly Thr Val Asp lie Thr Asn Asn Thr Ala Pro Thr Leu 
205 210 215 

TTC TCG AAC AAT ATT GCT GAA GCT GCA GGT GGA GCT ATA AAT AGC ACA 9 63 

Phe Ser Asn Asn lie Ala Glu Ala Ala Gly Gly Ala lie Asn Ser Thr 

220 225 230 235 

GGA AAC TGT ACA ATT ACA GGG AAT ACG TCT CTT GTA TTT TCT GAA AAT 1011 

Gly Asn Cys Thr lie Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn 
240 245 250 



85 



AGT GTG ACA 
Ser Val Thr 



GAT GTT ACC 
Asp Val Thr 
270 

GCT GTA GCT 
Ala Val Ala 
285 

TCC GGG GGG 
Ser Gly Gly 
300 

ACC ACT GCA 
Thr Thr Ala 



TGT AGT CTT 
Cys Ser Leu 



ATT GTT GCA 
lie Val Ala 
350 

GGA TCT ACT 
Gly Ser Thr 
365 

ATC TTT TTC 
lie Phe Phe 
380 

GAT ACT TTA 
Asp Thr Leu 



AGT GGG TCG 
Ser Gly Ser 



AAA GTT GCA 
Lys Val Ala 
430 

ACT GCA GGA 
Thr Ala Gly 
445 

GGC TTT ACT 
Gly Phe Thr 
460 



GCG ACC GCA 
Ala Thr Ala 
255 

ATA TCT GGG 
lie Ser Gly 



AAT GGC GGA 
Asn Gly Gly 



GGG GGG GGT 
Gly Gly Gly 
305 

GGT AAT GGT 
Gly Asn Gly 
320 

TCA GCA GAA 
Ser Ala Glu 
335 

ACT ACA CCA 
Thr Thr Pro 



GCA AAG ATC 
Ala Lys lie 

TAC GAT CCG 
Tyr Asp Pro 
385 

AAT CTC AAT 
Asn Leu Asn 
400 

ATT GTT TTT 
lie Val Phe 
415 

GAC AAC CTC 
Asp Asn Leu 

AAT TTA GTA 
Asn Leu Val 



CAG ACC GCG 
Gin Thr Ala 
465 



GGA AAT GGA 
Gly Asn Gly 
260 

AAT CAG AGT 
Asn Gin Ser 
275 

GCC ATT TAT 
Ala lie Tyr 
290 

ATC TCC TTT 
lie Ser Phe 



GGA GCC ATT 
Gly Ala lie 



GCA GGG GAC 
Ala Gly Asp 
340 

CAA ACT ACA 
Gin Thr Thr 
355 

ACG AAT TTA 
Thr Asn Leu 
370 

ATT ACT GCT 
lie Thr Ala 



AAG GCT GAT 
Lys Ala Asp 



TCT GGT GAA 
Ser Gly Glu 
420 

ACT TCT ACG 
Thr Ser Thr 
435 

CTT AAA CGT 
Leu Lys Arg 
450 

GGT TCC TCT 
Gly Ser Ser 



GGA GCT CTT 
Gly Ala Leu 



GTA ACT TTC 
Val Thr Phe 



GCT AAG AAG 
Ala Lys Lys 
295 

TCT AAC AAT 
Ser Asn Asn 
310 

TCT ATA CTG 
Ser lie Leu 
325 

ATT ACC TTC 
lie Thr Phe 



AAA AGA AAT 
Lys Arg Asn 



CGT GCA ATA 
Arg Ala lie 
375 

AAT ACG GCT 
Asn Thr Ala 
390 

GCA GGT AAT 
Ala Gly Asn 
405 

AAG CTC TCT 
Lys Leu Ser 



CTG AAG CAG 
Leu Lys Gin 



GGT GTC ACT 
Gly Val Thr 
455 

GTT ATT ATG 
Val lie Met 
470 



TCT GGA GAT 
Ser Gly Asp 
265 

TCA GGA AAC 
Ser Gly Asn 
280 

CTT ACA CTG 
Leu Thr Leu 



ATA GTC CAA 
lie Val Gin 



GCA GCT GGA 
Ala Ala Gly 
330 

AAT GGG AAT 
Asn Gly Asn 
345 

TCT ATT GAC 
Ser lie Asp 
360 

TCT GGG CAT 
Ser Gly His 



GCG GAT TCT 
Ala Asp Ser 



AGT ACA GAT 
Ser Thr Asp 
410 

GAA GAT GAA 
Glu Asp Glu 
425 

CCT GTA ACT 
Pro Val Thr 
440 

CTC GAT ACG 
Leu Asp Thr 



GAT GCG GGC 
Asp Ala Gly 



GCC 1059 
Ala 



CAA 1107 
Gin 



GCT 1155 
Ala 



GGT 12 03 

Gly 

315 

GAG 12 51 

Glu 



GCC 1299 
Ala 



ATA 1347 
He 



AGC 1395 
Ser 



ACA 1443 

Thr 

395 

TAT 1491 
Tyr 



GCA 153 9 

Ala 



CTA 1587 
Leu 



AAA 1635 
Lys 

ACA 168 3 

Thr 

475 
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ACG TTA AAA 
Thr Leu Lys 



CCT GTA GAC 
Pro Val Asp 



GCA GCA AGT 
Ala Ala Ser 
510 

AAC CAA GGG 
Asn Gin Gly 
525 

TTT TCA TTT 
Phe Ser Phe 
540 

GTT CCA GCG 
Val Pro Ala 



GGT ACT TGG 
Gly Thr Trp 



ACT AAG ACA 
Thr Lys Thr 
590 

CCT GAG CGT 
Pro Glu Arg 
605 

TCA GAC ATC 
Ser Asp lie 
620 

CTT TGT TCA 
Leu Cys Ser 



GAT AAA GAT 
Asp Lys Asp 



GGA TAT GCT 
Gly Tyr Ala 
670 

AGC TTT GCC 
Ser Phe Ala 
685 



GCA AGT ACA 
Ala Ser Thr 
480 

TCT TTA GGC 
Ser Leu Gly 
495 

AAA AAT GTA 
Lys Asn Val 



AAT GCT TAT 
Asn Ala Tyr 



GTG CAG CTC 
Val Gin Leu 
545 

GTT CCT ACA 
Val Pro Thr 
560 

GGA ATG ACT 
Gly Met Thr 
575 

GCG ACA TTA 
Ala Thr Leu 



CAA GGA CCT 
Gin Gly Pro 



CAA GCG ATT 
Gin Ala lie 
625 

GAT CGA GGC 
Asp Arg Gly 
640 

AAG AAA GGG 
Lys Lys Gly 
655 

ATC GGA GGT 
lie Gly Gly 



TTT TGC CAA 
Phe Cys Gin 



GAG GAG GTC 
Glu Glu Val 



GAG GGT AAG 
Glu Gly Lys 
500 

GCC CTT AGT 
Ala Leu Ser 
515 

GAA AAT CAC 
Glu Asn His 
530 

TCT GCT CTG 
Ser Ala Leu 



GTA GCA ACT 
Val Ala Thr 



TGG GTT GAT 
Trp Val Asp 
580 

GCT TGG ACC 
Ala Trp Thr 
595 

TTA GTT CCT 

Leu Val Pro 
610 

CAA GGT GTC 

Gin Gly Val 



TTC TGG GCT 
Phe Trp Ala 



GAA AAA CGC 
Glu Lys Arg 
660 

GCA GCG CAA 
Ala Ala Gin 
675 

CTC TTT GGT 
Leu Phe Gly 
690 



ACT TTA ACA 
Thr Leu Thr 
485 

AAA GTT GTA 
Lys Val Val 



GGT CCG ATT 
Gly Pro lie 



GAC TTA GGA 
Asp Leu Gly 
535 

GGT ACT GCA 
Gly Thr Ala 
550 

CCT ACG CAC 
Pro Thr His 
565 

GAT ACC GCA 
Asp Thr Ala 



AAT ACA GGC 
Asn Thr Gly 



AAT AGC CTT 
Asn Ser Leu 
615 

ATA GAG AGA 
lie Glu Arg 
630 

GCG GGA GTC 
Ala Gly Val 
645 

AAA TAC CGT 
Lys Tyr Arg 



ACT TGT TCT 
Thr Cys Ser 

AGC GAT AAA 
Ser Asp Lys 
695 



GGT CTT TCC 
Gly Leu Ser 
490 

ATT GCT GCT 
lie Ala Ala 
505 

CTT CTT TTG 

Leu Leu Leu 
520 

AAA ACT CAA 

Lys Thr Gin 



ACA ACT ACA 
Thr Thr Thr 



TAT GGG TAT 
Tyr Gly Tyr 
570 

AGC ACT CCA 
Ser Thr Pro 
585 

TAC CTT CCG 

Tyr Leu Pro 
600 

TGG GGA TCT 

Trp Gly Ser 



AGT GCT TTG 
Ser Ala Leu 



GCC AAT TTC 
Ala Asn Phe 
650 

CAT AAA TCT 
His Lys Ser 
665 

GAA AAC TTA 
Glu Asn Leu 
680 

GAT TTC TTA 
Asp Phe Leu 



ATT 1731 
He 



TCT 1779 
Ser 



GAT 1827 
Asp 



GAC 1875 
Asp 



GAT 1923 

Asp 

555 

CAA 1971 
Gin 



AAG 2019 
Lys 



AAT 2 0 67 

Asn 



TTT 2115 
Phe 



ACT 2163 

Thr 

635 

TTA 2211 
Leu 



GGT 2259 
Gly 



ATT 2307 
He 



GTC 2355 
Val 
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GCT AAA AAT 
Ala Lys Asn 
700 

ATT ACA GAA 
lie Thr Glu 



GGC TCT TGG 
Gly Ser Trp 



AGC CAC GTC 
Ser His Val 
750 

GTG AAA GGT 
Val Lys Gly 
765 

TCT TCT CAT 

Ser Ser His 
780 

CCA TAC ATC 
Pro Tyr lie 



GAG AAA GGT 
Glu Lys Gly 



TTA TCT TTG 

Leu Ser Leu 
830 

GAC TTT TCT 

Asp Phe Ser 
845 

AAT GAT CCC 

Asn Asp Pro 
860 

GAA ACT TAT 

Glu Thr Tyr 



GGC AGT CAC 
Gly Ser His 



GTC TTT GAA 
Val Phe Glu 
910 



CAT ACT GAT 
His Thr Asp 
705 

TGT AGT GGG 
Cys Ser Gly 
720 

AGT CAT AAA 
Ser His Lys 
735 

AGT AAT GAT 
Ser Asn Asp 



TCT TGG GGG 
Ser Trp Gly 



TCT TAT CCT 
Ser Tyr Pro 
785 

AAA CTG AAT 
Lys Leu Asn 
800 

ACA GAA GGA 
Thr Glu Gly 
815 

CCT ATA GGG 
Pro lie Gly 



TAT GAT CTG 
Tyr Asp Leu 



AAA TGC ACT 
Lys Cys Thr 
865 

GCC AAT AAC 
Ala Asn Asn 
880 

TAC GCC TTC 
Tyr Ala Phe 
895 

GTT CGT GGA 
Val Arg Gly 



ACC TAT GCA 
Thr Tyr Ala 



TTC ATA GGT 
Phe lie Gly 



CCC CTC GTT 
Pro Leu Val 
740 

CTG AAG ACA 
Leu Lys Thr 
755 

AAT AAT GCT 
Asn Asn Ala 
770 

GAA TAC CTG 
Glu Tyr Leu 



CTG ACC TAT 
Leu Thr Tyr 



AGA TCT TTT 

Arg Ser Phe 
82 0 

GTG AAG TTT 

Val Lys Phe 
835 

ACT TTA TCC 

Thr Leu Ser 
850 

ACA GCA CTT 

Thr Ala Leu 



TTA GCA CGA 
Leu Ala Arg 



TCT CCT ATG 
Ser Pro Met 
900 

TCC 
Ser 



GGA GCC TTC 
Gly Ala Phe 
710 

TGT CTC TTA 
Cys Leu Leu 
725 

TTA GAA GGG 
Leu Glu Gly 



AAG TAT ACT 
Lys Tyr Thr 



TTT AAC ATG 
Phe Asn Met 
775 

CAT TGT TTT 
His Cys Phe 
790 

ATA CGT CAG 
lie Arg Gin 
805 

GAT GAC AGC 
Asp Asp Ser 



GAG AAG TTC 
Glu Lys Phe 



TAT GTT CCT 
Tyr Val Pro 
855 

GTA ATC AGC 
Val lie Ser 
870 

CAG GCC TTG 
Gin Ala Leu 
885 

TTT GAA GTG 
Phe Glu Val 



TAT ATC CAA 

Tyr lie Gin 

GAT AAA CTT 

Asp Lys Leu 
730 

CAG CTC GCT 

Gin Leu Ala 
745 

GCG TAT CCT 

Ala Tyr Pro 
760 

ATG TTG GGA 

Met Leu Gly 



GAT ACC TAT 
Asp Thr Tyr 



GAC AGC TTC 
Asp Ser Phe 
810 

AAC CTC TTC 
Asn Leu Phe 
825 

TCT GAT TGT 
Ser Asp Cys 
840 

GAT CTT ATC 
Asp Leu lie 



GGA GCC TCT 
Gly Ala Ser 



CAA GTG CGT 
Gin Val Arg 
890 

CTC GGC CAG 
Leu Gly Gin 
905 



CAC 24 03 

His 

715 

CCT 2451 
Pro 



TAT 2499 
Tyr 



GAG 2547 
Glu 



GCT 2595 
Ala 



GCT 2643 

Ala 

795 

TCG 2691 
Ser 



AAT 2739 
Asn 



AAT 2787 
Asn 



CGC 2835 
Arg 



TGG 2883 

Trp 

875 

GCA 2931 
Ala 



T^TT 2 97 9 

Phe 



3000 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 
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(i) SEQUENCE CHARA.CTERI STICS : 

(A) LENGTH: 914 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Lys Ser Gin Phe Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys 
15 10 15 

Phe Thr -Ser Cys Ser Thr Val Phe Ala Ala Thr Ala Glu Asn lie Gly 
20 25 30 

Pro Ser Asp Ser Phe Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro 
35 40 45 

Lys Asn Thr Thr Thr Gly lie Asp Tyr Thr Leu Thr Gly Asp lie 'Thr 
50 55 60 

Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser 
65 70 75 80 

Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser 
85 90 95 

Phe Leu Asn lie Lys Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr 
100 105 110 

Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu 
115 120 125 

Ala Ala Pro Ser Ser Val lie Thr Thr Pro Ser Gly Lys Gly Ala Val 
130 135 140 

Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn Gly Thr lie Leu Phe 
145 150 155 160 

Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala lie Ser Thr Lys Asn 
165 170 175 

Leu Ser Leu Lys Asn Ser Thr Gly Ser lie Ser Phe Glu Gly Asn Lys 
180 185 190 

Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala lie Cys Ala Thr Gly Thr 
195 200 205 

Val Asp lie Thr Asn Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn lie 
210 215 220 

Ala Glu Ala Ala Gly Gly Ala lie Asn Ser Thr Gly Asn Cys Thr lie 
225 230 235 240 
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Thr Gly Asn Thr 



Ala Gly Asn Gly 
260 

Gly Asn Gin Ser 
275 

Gly Ala lie Tyr 
290 

Gly lie Ser Phe 
305 

Gly Gly Ala lie 



Glu Ala Gly Asp 
340 

Pro Gin Thr Thr 
355 

lie Thr Asn Leu 
370 

Pro lie Thr Ala 

385 

Asn Lys Ala Asp 



Phe Ser Gly Glu 
420 

Leu Thr Ser Thr 
435 

Val Leu Lys Arg 
450 

Ala Gly Ser Ser 
465 

Thr Glu Glu Val 



Gly Glu Gly Lys 
500 

Val Ala Leu Ser 
515 

Tyr Glu Asn His 
530 

Leu Ser Ala Leu 
545 



Ser Leu Val Phe 
245 

Gly Ala Leu Ser 



Val Thr Phe Ser 
280 

Ala Lys Lys Leu 
295 

Ser Asn Asn lie 
310 

Ser lie Leu Ala 
325 

lie Thr Phe Asn 



Lys Arg Asn Ser 
360 

Arg Ala lie Ser 
375 

Asn Thr Ala Ala 
390 

Ala Gly Asn Ser 
405 

Lys Leu Ser Glu 

Leu Lys Gin Pro 
440 

Gly Val Thr Leu 
455 

Val lie Met Asp 
470 

Thr Leu Thr Gly 
485 

Lys Val Val lie 



Gly Pro lie Leu 
520 

Asp Leu Gly Lys 
535 

Gly Thr Ala Thr 
550 



Ser Glu Asn Ser 
250 

Gly Asp Ala Asp 
265 

Gly Asn Gin Ala 



Thr Leu Ala Ser 
300 

Val Gin Gly Thr 
315 

Ala Gly Glu Cys 
330 

Gly Asn Ala lie 
345 

lie Asp lie Gly 



Gly His Ser lie 
380 

Asp Ser Thr Asp 
395 

Thr Asp Tyr Ser 
410 

Asp Glu Ala Lys 
425 

Val Thr Leu Thr 



Asp Thr Lys Gly 
460 

Ala Gly Thr Thr 
475 

Leu Ser lie Pro 
490 

Ala Ala Ser Ala 
505 

Leu Leu Asp Asn 



Thr Gin Asp Phe 
540 

Thr Thr Asp Val 
555 



Val Thr Ala Thr 
255 

Val Thr lie Ser 
270 

Val Ala Asn Gly 
285 

Gly Gly Gly Gly 



Thr Ala Gly Asn 
320 

Ser Leu Ser Ala 
335 

Val Ala Thr Thr 
350 

Ser Thr Ala Lys 

365 " 

Phe Phe Tyr Asp 



Thr Leu Asn Leu 
400 

Gly Ser lie Val 
415 

Val Ala Asp Asn 
430 

Ala Gly Asn Leu 
445 

Phe Thr Gin Thr 



Leu Lys Ala Ser 
480 

Val Asp Ser Leu 
495 

Ala Ser Lys Asn 
510 

Gin Gly Asn Ala 
525 

Ser Phe Val Gin 



Pro Ala Val Pro 
560 
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Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin Gly Thr Trp Gly Met 
565 570 575 

Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr 
580 585 590 

Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gin Gly 
595 600 605 

Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe Ser Asp lie Gin Ala 
610 615 620 

lie Gin Gly Val lie . Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg 
625 630 635 640 

Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys 
645 650 655 

Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly Gly Tyr Ala lie Gly 
660 665 670 

Gly Ala Ala Gin Thr Cys Ser Glu Asn Leu lie 'Ser Phe Ala Phe Cys 
675 680 685 

Gin Leu Phe Gly Ser Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr 
690 695 700 

Asp Thr Tyr Ala Gly Ala Phe Tyr lie Gin His lie Thr Glu Cys Ser 
705 710 715 720 

Gly Phe lie Gly Cys Leu Leu Asp Lys Leu Pro Gly Ser Trp Ser His 
725 730 735 

Lys Pro Leu Val Leu Glu Gly Gin Leu Ala Tyr Ser His Val Ser Asn 
740 745 750 

Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp 
755 760 765 

Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr 
770 775 780 

Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala Pro Tyr lie Lys Leu 
785 790 795 800 

Asn Leu Thr Tyr lie Arg Gin Asp Ser Phe Ser Glu Lys Gly Thr Glu 
805 810 815 

Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro lie 
820 825 830 

Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp 
835 840 845 

Leu Thr Leu Ser Tyr Val Pro Asp Leu lie Arg Asn Asp Pro Lys Cys 
850 855 860 
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Thr Thr TVla Leu Val lie Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn 
865 870 875 880 

Asn Leu Ala Arg Gin Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala 
885 890 895 

Phe Ser Pro Met Phe Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg 
900 905 910 

Gly Ser 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1200 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GAT CCT AAA AAT AAA GAG TAC ACA GGG ACC ATA CTC TTT TCT GGA GAA 4 8 

Asp Pro Lys Asn Lys Glu Tyr Thr Gly Thr lie Leu Phe Ser Gly Glu 
15 10 15 

AAG AGT CTA GCA AAC GAT CCT AGG GAT TTT AAA TCT ACA ATC CCT CAG 96 
Lys Ser Leu Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr lie Pro Gin 
20 25 30 

AAC GTC T^AC CTG TCT GCA GGA TAC TTA GTT ATT AAA GAG GGG GCC GAA 14 4 

Asn Val Asn Leu Ser Ala Gly Tyr Leu Val lie Lys Glu Gly Ala Glu 
35 40 45 

GTC ACA GTT TCA AAA TTC ACG CAG TCT CCA GGA TCG CAT TTA GTT TTA 192 
Val Thr Val Ser Lys Phe Thr Gin Ser Pro Gly Ser His Leu Val Leu 
50 55 60 

GAT TTA GGA ACC AAA CTG ATA GCC TCT AAG GAA GAC ATT GCC ATC ACA 24 0 

Asp Leu Gly Thr Lys Leu lie Ala Ser Lys Glu Asp lie Ala lie Thr 
65 70 75 80 

GGC CTC GCG ATA GAT ATA GAT AGC TTA AGC TCA TCC TCA ACA GCA GCT 2 88 

Gly Leu Ala lie Asp lie Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala 
85 90 95 

GTT ATT AAA GCA AAC ACC GCA AAT AAA CAG ATA TCC GTG ACG GAC TCT 336 
Val lie Lys Ala Asn Thr Ala Asn Lys Gin lie Ser Val Thr Asp Ser 
100 105 110 
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ATA GAA CTT ATC TCG CCT ACT GGC AAT GCC TAT GAA GAT CTC AGA ATG 384 

lie Glu Leu lie Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met 
115 120 125 

AGA AAT TCA CAG ACG TTC CCT CTG CTC TCT TTA GAG CCT GGA GCC GGG 4 32 

Arg Asn Ser Gin Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly 
130 135 140 

GGT AGT GTG ACT GTA ACT GCT GGA GAT TTC CTA CCG GTA AGT CCC CAT 4 80 

Gly Ser Val Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His 
145 150 155 160 

TAT GGT TTT CAA GGC AAT TGG AAA TTA GCT TGG ACA GGA ACT GGA AAC 52 8 

Tyr Gly Phe Gin Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn 
165 170 175 

AAA GTT GGA GAA TTC TTC TGG GAT AAA ATA AAT TAT AAG CCT AGA CCT 57 6 

Lys Val Gly Glu Phe Phe Trp Asp Lys lie Asn Tyr Lys Pro Arg Pro 

180 185 190 

GAA AAA GAA GGA AAT TTA GTT CCT AAT ATC TTG TGG GGG AAT GCT GTA 624 

Glu Lys Glu Gly Asn Leu Val Pro Asn lie Leu Trp Gly Asn Ala Val 
195 200' 205 

AAT GTC AGA TCC TTA ATG CAG GTT CAA GAG ACC CAT GCA TCG AGC TTA 672 

Asn Val Arg Ser Leu Met Gin Val Gin Glu Thr His Ala Ser Ser Leu 
210 215 220 

CAG ACA GAT CGA GGG CTG TGG ATC GAT GGA ATT GGG AAT TTC TTC CAT 72 0 

Gin Thr Asp Arg Gly Leu Trp lie Asp Gly lie Gly Asn Phe Phe His 
225 230 235 240 

GTA TCT GCC TCC GAA GAC AAT ATA AGG TAC CGT CAT AAC AGC GGT GGA 7 68 

Val Ser Ala Ser Glu Asp Asn lie Arg Tyr Arg His Asn Ser Gly Gly 
245 250 255 

TAT GTT CTA TCT GTA AAT AAT GAG ATC ACA CCT AAG CAC TAT ACT TCG 816 

Tyr Val Leu Ser Val Asn Asn Glu lie Thr Pro Lys His Tyr Thr Ser 

260 265 270 

ATG GCA TTT TCC CAA CTC TTT AGT AGA GAC AAA GAC TAT GCG GTT TCC 8 64 

Met Ala Phe Ser Gin Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser 
275 280 285 

AAC AAC GAA TAC AGA ATG TAT TTA GGA TCG TAT CTC TAT CAA TAT ACA 912 

Asn Asn Glu Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gin Tyr Thr 
290 295 300 

ACC TCC CTA GGG AAT ATT TTC CGT TAT GCT TCG CGT AAC CCT 7VAT GTA 960 

Thr Ser Leu Gly Asn lie Phe Arg Tyr Ala Ser Arg Asn Pro Asn Val 
305 310 315 320 

AAC GTC GGG ATT CTC TCA AGA AGG TTT CTT CAA AAT CCT CTT ATG ATT 1008 

Asn Val Gly lie Leu Ser Arg Arg Phe Leu Gin Asn Pro Leu Met lie 
325 330 335 



93 



TTT CAT TTT TTG TGT GCT TAT GOT CAT GCC ACC AAT GAT ATG AAA ACA 1056 

Phe His Phe Leu Cys Ala Tyr Gly His Ala Thr Asn Asp Met Lys Thr 
340 345 350 

GAG TAC GCA AAT TTC CCT ATG GTG AAA AAC AGO TGG AGA AAC AAT TGT 1104 

Asp Tyr Ala Asn Phe Pro Met Val Lys Asn Ser Trp Arg Asn Asn Cys 
355 360 365 

TGG GCT ATA AAA TGC GGA GGG AGC ATG CCT CTA TTG GTA TTT GAA AAC 1152 

Trp Ala lie Lys Cys Gly Gly Ser Met Pro Leu Leu Val Phe Glu Asn 

370 375 380 

GGA AAA CTT TTC CAA GGT GCC ATC CCA TTT ATG AAA CTA CAA TTA GTT 12 00 

Gly Lys Leu Phe Gin Gly Ala lie Pro Phe Met Lys Leu Gin Leu Val 

385 390 395 400 



(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Asp Pro Lys Asn Lys Glu Tyr Thr Gly Thr lie Leu Phe Ser Gly Glu 
15 10 15 

Lys Ser Leu Ala Asn Asp Pro Arg Asp Phe Lys Ser Thr lie Pro Gin 
20 25 30 

Asn Val Asn Leu Ser Ala Gly Tyr Leu Val lie Lys Glu Gly Ala Glu 
35 40 45 

Val Thr Val Ser Lys Phe Thr Gin Ser Pro Gly Ser His Leu Val Leu 
50 55 60 

Asp Leu Gly Thr Lys Leu lie Ala Ser Lys Glu Asp lie Ala lie Thr 
65 70 75 80 

Gly Leu Ala lie Asp lie Asp Ser Leu Ser Ser Ser Ser Thr Ala Ala 
85 90 95 

Val lie Lys Ala Asn Thr Ala Asn Lys Gin lie Ser Val Thr Asp Ser 
100 105 110 

lie Glu Leu lie Ser Pro Thr Gly Asn Ala Tyr Glu Asp Leu Arg Met 
115 120 125 

Arg Asn Ser Gin Thr Phe Pro Leu Leu Ser Leu Glu Pro Gly Ala Gly 
130 135 140 
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Gly Ser Val Thr Val Thr Ala Gly Asp Phe Leu Pro Val Ser Pro His 
145 150 155 160 

Tyr Gly Phe Gin Gly Asn Trp Lys Leu Ala Trp Thr Gly Thr Gly Asn 
165 170 175 

Lys Val Gly Glu Phe Phe Trp Asp Lys lie Asn Tyr Lys Pro Arg Pro 
180 185 190 

Glu Lys Glu Gly Asn Leu Val Pro Asn lie Leu Trp Gly Asn Ala Val 
195 200 205 

Asn Val Arg Ser Leu Met Gin Val Gin Glu Thr His Ala Ser Ser Leu 
210 215 220 

Gin Thr Asp Arg Gly Leu Trp lie Asp Gly lie Gly Asn Phe Phe His 
225 230 235 240 

Val Ser Ala Ser Glu Asp Asn lie Arg Tyr Arg His Asn Ser Gly Gly 
245 250 255 

Tyr Val Leu Ser Val Asn. Asn Glu lie Thr Pro Lys His Tyr Thr Ser 
260 ' 265 270 

Met Ala Phe Ser Gin Leu Phe Ser Arg Asp Lys Asp Tyr Ala Val Ser 
275 280 285 

Asn Asn Glu Tyr Arg Met Tyr Leu Gly Ser Tyr Leu Tyr Gin Tyr Thr 
290 295 300 

Thr Ser Leu Gly Asn lie Phe Arg Tyr Ala Ser Arg Asn Pro Asn Val 
305 -310 315 320 

Asn Val Gly lie Leu Ser Arg Arg Phe Leu Gin Asn Pro Leu Met lie 
325 330 335 

Phe His Phe Leu Cys Ala Tyr Gly His Ala Thr Asn Asp Met Lys Thr 
340 345 350 

Asp Tyr Ala Asn Phe Pro Met Val Lys Asn Ser Trp Arg Asn Asn Cys 
355 360 365 

Trp Ala lie Lys Cys Gly Gly Ser Met Pro Leu Leu Val Phe Glu Asn 
370 375 380 

Gly Lys Leu Phe Gin Gly Ala lie Pro Phe Met Lys Leu Gin Leu Val 
385 390 395 400 



(2) INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1830 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GAT CTC ACA TTA GGG AGT CGT GAC AGT TAT AAT GGT GAT ACA AGC ACC 4 8 

Asp Leu Thr Leu Gly Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr 
15 10 15 

ACA GAA TTT ACT CCT AAA GCG GCA ACT TCT GAT GCT AGT GGC ACG ACC 96 
Thr Glu Phe Thr Pro Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr 
20 25 30 

TAT ATT CTC GAT GGG GAT GTC TCG ATA AGC CAA GCA GGG AAA CAA ACG 14 4 

Tyr lie Leu Asp Gly Asp Val Ser lie Ser Gin Ala Gly Lys Gin Thr 
35 ■ 40 45 

AGC TTA ACC ACA AGT TGT TTT TCT AAC ACT GCA GGA AAT CTT ACC TTC 192 
Ser Leu Thr Thr. Ser Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe 
50 55 60 

TTA GGG AAC GGA TTT TCT CTT CAT TTT GAC AAT ATT ATT TCG TCT ACT 24 0 

Leu Gly Asn Gly Phe Ser Leu His Phe Asp Asn lie lie Ser Ser Thr 
65 70 75 80 

GTT GCA GGT GTT GTT GTT AGC AAT ACA GCA GCT TCT GGG ATT ACG AAA 28 8 

Val Ala Gly Val Val Val Ser Asn Thr Ala Ala Ser Gly lie Thr Lys 
85 90 95 

TTC TCA GGA TTT TCA ACT CTT CGG ATG CTT GCA GCT CCT AGG ACC ACA 33 6 

Phe Ser Gly Phe Ser Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr 
100 105 110 

GGT AAA GGA GCC ATT AAA ATT ACC GAT GGT CTG GTG TTT GAG AGT ATA 384 
Gly Lys Gly Ala lie Lys lie Thr Asp Gly Leu Val Phe Glu Ser lie 
115 120 125 

GGG AAT CTT GAT CCG ATT ACT GTA ACA GGA TCG ACA TCT GTT GCT GAT 432 
Gly Asn Leu Asp Pro lie Thr Val Thr Gly Ser Thr Ser Val Ala Asp 
130 135 140 

GCT CTC AAT ATT AAT AGC CCT GAT ACT GGA GAT AAC AAA GAG TAT ACG 480 
Ala Leu Asn lie Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr 
145 150 155 160 

GGA ACC ATA GTC TTT TCT GGA GAG AAG CTC ACG GAG GCA GAA GCT AAA 52 8 

Gly Thr lie Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys 
165 170 175 

GAT GAG AAG AAC CGC ACT TCT AAA TTA CTT CAA AAT GTT GCT TTT AAA 57 6 

Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys 
180 185 190 
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AAT GGG ACT GTA GTT TTA AAA GGT GAT GTC GTT TTA AGT GCG AAC GGT 62 4 

Asn Gly Thr Val Val Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly 
195 200 205 

TTC TCT CAG GAT GCA AAC TCT AAG TTG ATT ATG GAT TTA GGG ACG TCG 672 

Phe Ser Gin Asp Ala Asn Ser Lys Leu lie Met Asp Leu Gly Thr Ser 
210 215 220 

TTG GTT GCA AAC ACC GAA AGT ATC GAG TTA ACG AAT TTG GAA ATT AAT 72 0 

Leu Val Ala Asn Thr Glu Ser lie Glu Leu Thr Asn Leu Glu lie Asn 
225 230 235 240 

ATA GAC TCT CTC AGG AAC GGG AAA AAG ATA AAA CTC AGT GCT GCC ACA 7 68 

lie Asp Ser Leu Arg Asn Gly Lys Lys lie Lys Leu Ser Ala Ala Thr 
245 250 255 

GCT CAG AAA GAT ATT CGT ATA GAT CGT CCT GTT GTA CTG GCA ATT AGC 816 

Ala Gin Lys Asp lie Arg lie Asp Arg Pro Val Val Leu Ala lie Ser 

260 265 270 

GAT GAG AGT TTT TAT CAA AAT GGC TTT TTG AAT GAG GAC CAT TCC TAT 8 64 

Asp Glu Ser Phe Tyr Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr 
275 280 285 

GAT GGG ATT CTT GAG TTA GAT GCT GGG AAA GAC ATC GTG ATT TCT GCA 912 

Asp Gly lie Leu Glu Leu Asp Ala Gly Lys Asp lie Val lie Ser Ala 
290 295 300 

GAT TCT CGC AGT ATA GAT GCT GTA CAA TCT CCG TAT GGC TAT CAG GGA 9 60 

Asp Ser Arg Ser lie Asp Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly 
305 310 315 320 

AAG TGG ACG ATC AAT TGG TCT ACT GAT GAT AAG AAA GCT ACG GTT TCT 10 08 

Lys Trp Thr lie Asn Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser 
325 330 335 

TGG GCG AAG CAG AGT TTT AAT CCC ACT GCT GAG CAG GAG GCT CCG TTA 1056 

Trp Ala Lys Gin Ser Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu 

340 345 350 

GTT CCT AAT CTT CTT TGG GGT TCT TTT ATA GAT GTT CGT TCC TTC CAG 1104 

Val Pro Asn Leu Leu Trp Gly Ser Phe lie Asp Val Arg Ser Phe Gin 
355 360 365 

AAT TTT ATA GAG CTA GGT ACT GAA GGT GCT CCT TAC GAA AAG AGA . TTT 1152 

Asn Phe lie Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe 
370 375 380 

TGG GTT GCA GGC ATT TCC AAT GTT TTG CAT AGG AGC GGT CGT GAA AAT 12 00 

Trp Val Ala Gly lie Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn 
385 390 395 400 

CAA AGG AAA TTC CGT CAT GTG AGT GGA GGT GCT GTA GTA GGT GCT AGC . 124 8 

Gin Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser 
405 410 415 



ACG AGG ATG CCG GGT GGT GAT ACC TTG TCT CTG GGT TTT GCT CAG CTC 12 9 6 

Thr Arg Met Pro Gly Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu 
420 425 430 

TTT GCG CGT GAC AAA GAG TAG TTT ATG AAT ACC AAT TTC GCA AAG ACC 134 4 

Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr 
435 440 445 

TAG GCA GGA TCT TTA CGT TTG CAG CAC GAT GCT TCC CTA TAC TCT GTG 1392 

Tyr Ala Gly Ser Leu Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val 
450 455 460 

GTG AGT ATC CTT TTA GGA GAG GGA GGA CTC CGC GAG ATC CTG TTG CCT 14 4 0 

Val Ser lie Leu Leu Gly Glu Gly Gly Leu Arg Glu lie Leu Leu Pro 

465 470 475 480 

TAT GTT TCC AAT ACT CTG CCG TGC TCT TTC TAT GGG CAG CTT AGC TAC 14 8 8 

Tyr Val Ser Asn Thr Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr 
485 490 495 

GGC CAT ACG GAT CAT CGC ATG AAG ACC GAG TCT CTA CCC CCC CCC CCC 153 6 

Gly His Thr Asp His Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro 
500 505 510 

CCG ACG CTC TCG ACG GAT CAT ACT TCT TGG GGA GGA TAT GTC TGG GCT 1584 

Pro Thr Leu Ser Thr Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala 
515 520 525 

GGA GAG CTG GGA ACT CGA GTT GCT GTT GAA AAT ACC AGC GGC AGA GGA 1632 

Gly Glu Leu Gly Thr Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly 
530 535 540 

TTT TTC CGA GAG TAC ACT CCA TTT GTA AAA GTC CAA GCT GTT TAC TCG 168 0 

Phe Phe Arg Glu Tyr Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ser 

545 550 555 560 

CGC CAA GAT AGC TTT GTT GAA CTA GGA GCT ATC AGT CGT GAT TTT AGT 172 8 

Arg Gin Asp Ser Phe Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser 
565 570 575 

GAT TCG CAT CTT TAT AAC CTT GCG ATT CCT CTT GGA ATC AAG TTA GAG 17 7 6 

Asp Ser His Leu Tyr Asn Leu Ala lie Pro Leu Gly lie Lys Leu Glu 
580 585 590 

AAA CGG TTT GCA GAG CAA TAT TAT CAT GTT GTT GCG ATG TAT TCT CCA 1824 

Lys Arg Phe Ala Glu Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro 
595 600 605 

GAT GTT 1830 
Asp Val 
610 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 



# .3 • 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Asp Leu Thr Leu Gly Ser Arg Asp Ser Tyr Asn Gly Asp Thr Ser Thr 
15 10 15 

Thr Glu Phe Thr Pro Lys Ala Ala Thr Ser Asp Ala Ser Gly Thr Thr 
20 25 30 

Tyr lie Leu Asp Gly Asp Val Ser lie Ser Gin Ala Gly Lys Gin Thr 
35 40 45 

Ser Leu Thr Thr Ser Cys Phe Ser Asn Thr Ala Gly Asn Leu Thr Phe 
50 55 60 

Leu Gly Asn Gly Phe Ser Leu His Phe Asp Asn lie lie Ser Ser Thr 
65 70 75 80 

Val Ala Gly Val Val Val Ser Asn Thr Ala Ala Ser Gly lie Thr Lys 
85 90 95 

Phe Ser Gly Phe Ser Thr Leu Arg Met Leu Ala Ala Pro Arg Thr Thr 
100 105 110 

Gly Lys Gly Ala lie Lys lie Thr Asp Gly Leu Val Phe Glu Ser lie 
115 120 125 

Gly Asn Leu Asp Pro lie Thr Val Thr Gly Ser Thr Ser Val Ala Asp 
130 135 140 

Ala Leu Asn lie Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr 
145 150 155 160 

Gly Thr lie Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys 
165 170 175 

Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys 
180 185 190 

Asn Gly Thr Val Val Leu Lys .Gly Asp Val Val Leu Ser Ala Asn Gly 
195 200 205 

Phe Ser Gin Asp Ala Asn Ser Lys Leu lie Met Asp Leu Gly Thr Ser 
210 215 220 

Leu Val Ala Asn Thr Glu Ser lie Glu Leu Thr Asn Leu Glu lie Asn 
225 230 235 240 

lie Asp Ser Leu Arg Asn Gly Lys Lys lie Lys Leu Ser Ala Ala Thr 
245 250 255 



Ala Gin Lys Asp 
260 

Asp Glu Ser Phe 
275 

Asp Gly lie Leu 
290 

Asp Ser Arg Ser 
305 

Lys Trp Thr lie 



Trp Ala Lys Gin 
340 

Val Pro Asn Leu 
355 

Asn Phe lie Glu 
370 

Trp Val Ala Gly 
385 

Gin Arg Lys Phe 



Thr Arg Met Pro 
420 

Phe Ala Arg Asp 
435 

Tyr Ala Gly Ser 
450 

Val Ser lie Leu 
465 

Tyr Val Ser Asn 



Gly His Thr Asp 
500 

Pro Thr Leu Ser 
515 

Gly Glu Leu Gly 
530 

Phe Phe Arg Glu 
545 



lie Arg lie Asp 



Tyr Gin Asn Gly 
280 

Glu Leu Asp Ala 
295 

lie Asp Ala Val 
310 

Asn Trp Ser Thr 
325 

Ser Phe Asn Pro 



Leu Trp Gly Ser 
360 

Leu Gly Thr Glu 
375 

lie Ser Asn Val 
390 

Arg His Val Ser 
405 

Gly Gly Asp Thr 



Lys Asp Tyr Phe 
440 

Leu Arg Leu Gin 
455 

Leu Gly Glu Gly 
470 

Thr Leu Pro Cys 
485 

His Arg Met Lys 



Thr Asp His Thr 
520 

Thr Arg Val Ala 
535 

Tyr Thr Pro Phe 
550 
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Arg Pro Val Val 
265 

Phe Leu Asn Glu 



Gly Lys Asp lie 
300 

Gin Ser Pro Tyr 
315 

Asp Asp Lys Lys 
330 

Thr Ala Glu Gin 
345 

Phe lie Asp Val 



Gly Ala Pro Tyr 
380 

Leu His Arg Ser 
395 

Gly Gly Ala Val 
410 

Leu Ser Leu Gly 
425 

Met Asn Thr Asn 



His Asp Ala Ser 
460 

Gly Leu Arg Glu 
475 

Ser Phe Tyr Gly 
490 

Thr Glu Ser Leu 
505 

Ser Trp Gly Gly 



Val Glu Asn Thr 
540 

Val Lys Val Gin 
555 



Leu Ala lie Ser 
270 

Asp His Ser Tyr 
285 

Val lie Ser Ala 



Gly Tyr Gin Gly 
320 

Ala Thr Val Ser 

33.5 

Glu Ala Pro Leu 
350 

Arg Ser Phe Gin 
365 

Glu Lys Arg Phe 



Gly Arg Glu Asn 
400 

Val Gly Ala Ser 
415 

Phe Ala Gin Leu 
430 

Phe Ala Lys Thr 
445 

Leu Tyr Ser Val 



lie Leu Leu Pro 
480 

Gin Leu Ser Tyr 
495 

Pro Pro Pro Pro 
510 

Tyr Val Trp Ala 
525 

Ser Gly Arg Gly 



Ala Val Tyr Ser 
560 



Arg Gin Asp Ser 



Asp Ser His Leu 
580 

Lys Arg Phe Ala 
595 



Phe Val Glu Leu 
565 

Tyr Asn Leu Ala 



Glu Gin Tyr Tyr 
600 



Gly Ala lie Ser 
570 

lie Pro Leu Gly 
585 

His Val Val Ala 



Arg Asp Phe Ser 
575 

lie Lys Leu Glu 
590 

Met Tyr Ser Pro 
605 



Asp Val 
610 



